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Abstract 

We find response time distributions for aperiodic tasks that queue for the same 
server with periodic tasks for which deadlines are guaranteed. The periodic task 
stream is a sequence of tasks with constant time between adjacent periodic arrivals 
and constant service times. The periodic tasks have deadlines which are times by 
which each task must have completed service. Deadlines are equal to the arrival time 
of the next periodic task. Tasks with deadlines are called hard real-time tasks. 

The aperiodic task stream is a sequence of tasks with the time between arrivals 
drawn from an exponential distribution. The service time of each aperiodic task is 
also drawn from an exponential distribution. Aperiodic tasks are served in fifo order 
from within the aperiodic stream. The server will preempt the execution of aperiodic 
tasks to serve periodic tasks and guarantee that every periodic task meets its deadline. 

We study two different aperiodic service disciplines called background aperiodic 
service (BGA) and foreground aperiodic service (FGA). BGA, also known as preemp- 
tive fixed-priority scheduling, assigns high priority to periodic tasks and low priority 
to aperiodic tasks. In BGA, any aperiodic tasks in service at the time of a peri- 
odic task arrival will be preempted so periodic task execution can begin immediately. 
FGA, a special case of the slack stealer, assigns aperiodic tasks the highest priority 
whenever delaying the execution of a periodic task will not result in its deadline being 
missed. 

We develop a collection of aperiodic response time distributions. The response 
times are analyzed separately based on the service discipline (e.g. foreground or 
background). Within each service discipline, several analytic models are identified, 
adapted, and/or derived to characterize the response time distribution of the aperiodic 
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task stream. In some cases, the aperiodic system size distribution is also identified or 
derived- Criteria for model selection is shown to depend on specified and/or observed 
values of the system configuration (e.g. periodic interarrival and compute times, 
aperiodic interarrival and service rates, mean blocking time of aperiodics by periodics, 
etc.). All models and criteria are validated with simulation data. 
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Chapter 1 
Introduction 

1.1 Problem Description 

In this thesis we study the behavior of a single queue with two classes of traffic under 
two different service disciplines. Each traffic class consists of an infinite sequence of 
tasks, each task arriving from some external source and each with its own service 
time. Within a class, the time between the arrival of the n th and (n + 1)** task is 
called an interarrival time. The time required to execute the 71 th task is called a 
service time. Within each class, interarrival times and service times are independent 
of each other and drawn from common distributions. 

For one class of traffic, both the service and interarrival times are constant. 
This traffic class is also called periodic or deterministic since tasks arrive periodically 
(with period H) and the next arrival time is deterministic given any previous arrival 
time. Each newly arriving periodic task requests C time units for its computation 
time. 

The other traffic class is characterized by exponential interarrival and service 
times. This traffic class is called aperiodic, stochastic or random since the time of the 
next arrival is random and not periodic. The mean time between arrivals is denoted 
A" 1 . The mean service time is denoted by fx" 1 . 

A service discipline is a decision rule for selecting which among all queued and 
in-progress tasks to provide service. Only one task can receive service at any given 
time and the service rate is always constant (e.g. 1 sec/sec). 

We impose a rule that we call the periodic task deadline requirement 
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which states that the service discipline must guarantee that all periodic tasks complete 
prior to the arrival of the next periodic task (the next periodic arrival is an implied 
deadline). Such requirements apply to critical task functions, where repeated failure 
to execute within a prescribed time limit can result in catastrophe. For example, 
if our periodic process were a critical feedback control process with rate if" 1 , its 
output must be made available for input prior to the next iteration of the control 
task, otherwise the control function becomes unstable. 

In most control systems, much of the traffic utilization is periodic (e.g. 60%- 
95%), with occasional requests from aperiodic traffic. A typical aperiodic traffic exe- 
cution time (i.e. 1///) might range from 10 to 100 microseconds and periodic process 
periods (H values) often range from 10 milliseconds to several seconds. 1 Integrated 
voice and data applications is another candidate application domain, although the 
traffic classes will have different characteristics. Except for this mention to motive the 
definition and study of this problem, we will not dwell on domains of application. We 
also assume there is no system "benefit" in completing the execution of the periodic 
task prior to its deadline. The product Hp turns out to be a relevant factor in model 
selection. The extreme values for the ranges given are 400 = 40msec/ lOOjisec = 4-10 2 
and 100, 000 = lsec/10/jsec = 10 5 . For critical avionics functions, a plausible range 
is 4 • 10 2 < Hix < 10 5 for current day processor speeds. 

We investigate this mixed stochastic and deterministic queueing model under 
two different but related service disciplines. In both models, the server is preemptive- 
resume. A preemptive server will interrupt an in-service task to start or continue 
service of another task. A preemptive-resume policy means that a preempted task 
will resume from exactly where it left off when it returns to the server. In other 
words, if a task has a service time requirement C, and is preempted at time t having 

1 Execution times will decrease in proportion with an increase in processor speed, unlike periodic 
rates. 
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completed C(t) of its C required units, then upon its first return to service after time 
t, its remaining completion time requirement will be C — C(t). 

In multi-class queues, different priorities are sometimes assigned to different 
classes to give preferential treatment of one traffic class over another. In a preemptive 
discipline, the lower priority tasks are typically preempted to provide service to higher 
priority tasks. The same priority assignment rules apply to all tasks within a class. 
A fixed priority assignment assigns all tasks within a class a single fixed priority that 
changes with neither time nor the state of the queue. Dynamic priority assignments 
may vary as a function of time and/or system state. 

In our problem of study, the periodic traffic is critical, so must have highest 
priority for a period of time minimally long enough to complete service prior to the 
next periodic task arrival. For both service disciplines, we assume that with the 
aperiodic stream, service is in order of arrival, or First-In-First- Out (FIFO). Note 
that periodic tasks (from a single traffic source) cannot queue by the periodic task 
deadline requirement. 

We first investigate a service discipline called background aperiodic ser- 
vice. This service discipline falls under the category of what is known as the preemp- 
tive fixed priority server. Periodic tasks are assigned highest priority and aperiodic 
tasks are assigned lowest priority. If a periodic task arrives when an aperiodic task is 
in-service, the aperiodic task is preempted, and the periodic tasks begins execution 
until it completes, at which point the preempted periodic task resumes service. A 
task running in background has the lowest priority. 

The second service discipline we investigate we call foreground aperiodic 
service. A task running in foreground has the highest priority. However, aperiodic 
tasks are given highest priority only when the periodic task deadline requirement is 
not being violated, so priority assignment is dynamic and dependent on the difference 
defined by the periodic task's deadline minus its remaining service time requirement. 
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Foreground aperiodic priority assignment rules will be made precise in Chapter 6. 

1.2 Problem Illustration 

Figure LI illustrates the processor timelines for background aperiodic service (top) 
and foreground aperiodic service (bottom) when H = 10, C = 4 and the aperiodic 
arrival times (t„'s) and service times (a^'s) are as shown in Table 1.1. The aperiodic 
interarrival times and service times are not exponential, and were chosen primarily 
to illustrate the definitions. 

For background aperiodic service, the periodic task always has highest priority 
and always executes the first four units of every hyperperiod. All aperiodic tasks 
arriving during a periodic task's execution must wait for it to complete before the 
processor will spend time executing aperiodic tasks. Task priorities axe completely 
static under background aperiodic service. The 6^ aperiodic task is preempted at 
time 20 by the (high priority) periodic arrival and resumed at time 24, when the 
periodic task departs. 

For foreground aperiodic service, the periodic task only has the highest priority 
when it will otherwise miss its deadline. In Figure 1.1, aperiodic tasks always have 
highest priority, except when time is 16 units, since the second dispatch of the periodic 
task (which has received no time in the second hyperperiod), must execute four time 
units before time 20. At time 16, the periodic task's dynamic priority is increased 
(or equivalently, the aperiodic tasks' priorities are decreased), so the periodic task 
preempts the currently executing aperiodic task (which is the 6 th aperiodic task to 
arrive). 

Also shown in Table 1.1 are the departure times for the aperiodic tasks, dn^ and 
d n j when scheduled using background and foreground aperiodic service, respectively. 
One performance measure of interest is the aperiodic response time distribution for 
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Background Aperiodic Task Scheduling Timeline 




Foreground Aperiodic Task Scheduling Timeline 

Figure 1.1: Timeline for Example Sample Path; Top = BGA; Bottom = FGA 
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Table 1.1: Sample Path Data for Figure 1.1 

both the background and foreground service disciplines. The sample background and 
foreground aperiodic response time values are also shown in Table 1.1 as r njb = d nj i>-a n 
and r n j = d n j — a n , respectively. The next section elaborates more on measures of 
performance. 

1.3 Thesis Objectives 

Our objective is to model the performance of the aperiodic tasks, where modeling 
of physical phenomena is not equivalent to developing a mathematical anaysis of a 
hypothesized model. We will explore the applicability of existing models, adapt them 
when necessary and possible, and develop approximate models based on analytic and 
empirical techniques to better fit the data. 

The primary performance measure of interest is the response time of a task. A 
task's response time is the time it spends in the system, or equivalently, the instant at 
which the task completes service (and exits the system) minus the instant at which the 
task arrived to the system (and either entered the queue or began service). Figure 1.2 
illustrates the empirical distribution functions constructed from the background and 
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foreground aperiodic response times in the example of Figure 1.1. Based on this 
example, the response time EDF for the foreground aperiodic service discipline is 
better than the response time EDF for the background aperiodic service discipline. 
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background aperiodics 
foreground aperiodics 



Figure 1.2: Response Time EDFs for the Example in Figure 1.1 



If an objective is to produce shorter response times for a randomly observed 
aperiodic task, we will find the foreground aperiodic priority discipline is better. 
We will attempt to answer how much better, and under what conditions. We are 
interested in estimating the response time distribution (in contrast to say the average 
response time), with interest in the right tails of the distribution. We would like to 
be able to predict the probability that a randomly arriving aperiodic task's response 
time is less than some value, or perhaps compute a percentile value. In certain cases, 
we will also look at aperiodic queue length distributions. In all cases, we consider 
steady-state or time-averaged distributions (rather than transient distributions). 
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Even for these (relatively) simple service disciplines and traffic classes de- 
fined by our problem, a (single) precise mathematical analysis appears unachievable. 
In Chapters 2 and 3 we define how queueing models are transformed to stochastic 
processes. Analysis of the stochastic process typically takes one of several forms: (a) 
analytic solution for (easy) special cases and/or expected values, (b) numeric solution 
to state space methods, (c) analytic or numeric solution to approximating processes 
and (d) simulation. We restrict attention to models for describing steady state (or 
time-averaged) behaviour and compare our predictions to simulation results. 

1.4 Thesis Organization 

Chapters 2 through 4 primarily provide background and queueing theory results we 
will use in subsequent developments. Chapter 2 looks at results for a queue with 
a single server with a FIFO service discipline and one traffic stream. Chapter 3 
summarizes well known results for a fixed priority preemptive resume queue using 
FIFO service within each traffic stream. Chapter 4 contrasts techniques introduced 
in Chapters 2 and 3 and establishes the context of their applicability for the analysis 
of our problems. 

Chapters 5 and 6 comprise the development of the analyses for the response 
time (and system size) distributions. We did not find a single analytic model suitable 
for predicting response times, but rather we developed a collection of models and 
criteria for model selection based on the values of system parameter sets. The follow- 
ing categorizations for parameters were found to be useful when developing analytic 
methods for response time evaluation; 

1. Model selection and/or parameterization depends on whether the service disci- 
pline is background (Chapter 5) or foreground (Chapter 6). 
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2. The total system utilization p ranges from light to heavy. We concentrate on 
conditions of heavy traffic (i.e. p « 1), since aperiodic response times under 
these conditions provide a bounding function for lighter loads. However, models 
for lighter total traffic are also explicitly developed under certain circumstances, 
and often exhibit quite different characteristics, p can be roughly characterized 
by light, moderate or heavy. 

3. The periodic and aperiodic utilization, pi and p2 respectively, are varied when 
p is held constant. We consider the two cases when the total traffic is primarily 
aperiodic and periodic. In particular, we consider pi G {0.25, 0.75} to represent 
system behavior over a spectrum of parameter ranges. 

4. The hyperperiod H is the time between periodic arrivals. For simplicity, we 
typically assume the aperiodic service rate \x is one. The ratio of the hyperperiod 
to the average aperiodic service time HfpT 1 = Hp, (= #, when p = 1) turns 
out to be a factor when determining the suitability of various models. H is 
roughly characterized as short, medium and long. 

With the exception of the service discipline, the factor variables are continuous 
valued. We provide approximate model selection criteria based on continuous values 
for the (model selection) factor variables. 

In Chapters 5 and 6, our analyses are validated using a small set of parameters. 
Chapter 7 concludes with future work. Frequently used results are either cited or 
developed in the appendices. 



Chapter 2 

Some Queueing Theory Results 



This chapter introduces some standard notation, typical measures of performance, 
and many well known results for the single server queue with a single source of traffic. 
Numerous books have been written on this subject (e.g. [1], [11], [21], [22], [31], [35], 
and [38]). Many texts on introductory stochastic processes with applications also 
contain chapters on queueing systems, (e.g. [20] and [32]). 

The intent of this chapter is to provide just enough context to readers familiar 
with stochastic processes, but not with their application to queues, for the remainder 
of the thesis to be read stand-alone. Several results that we later use are summarized 
in this chapter (and Appendix A). 

2.1 Queueing Definitions and Notation 

For the single server queue with a single class of traffic, the following notation has 
become standard over the years. The queueing model described above is succinctly 
denoted by A/B/l, where A defines the interarrival time distribution and B describes 
the service time distribution. 1 

The system we consider consists of an unbounded queue and a single server. 
Tasks arrive to the system from a traffic stream originating outside the system. A 
renewal process defines the arrival times for incoming traffic. The time between 

1 More generally and commonly, it is A/B/m/q where m is the number of servers (a single queue), 
and q is a bound on queue size. We consider only m = 1, and q = oo, where the convention is to 
omit q. A convenient mnemonic for B when considering high priority traffic execution times is a 
denotation for blocking times, 

10 



CHAPTER 2. SOME QUEUEING THEORY RESULTS 



11 



adjacent arrivals is defined by the (common) interarrival distribution A(t) = P(T < 
t). A sequence of interarrival times {Ti,T 2 , 13, ...} determines the arrival time of r n , 
the n th task, is A n = J^ ml Tj. 

Each job adds a certain amount of work to the system which is defined by 
its service time. Let {Xi,X 2 ,X 3j ...} be an independent and identically distributed 
sample, drawn from a common service time distribution B(x) = P(X < x). In other 
words, the n th job to arrive to the system has service time X n and, at the instant 
of arrival A n , increases the system work by X n . Service times and arrival times are 
independent of each other. 

The process state variables of interest for the n th task, r n to arrive to the 
system are denoted by: 

A n = arrival time of r n 

T n = A n - An-i = n th interarrival time , T n ~ A V n (2.1) 
X n = the service time for r ny X n ~ BV n. 

A system with no tasks is said to be empty and the server will be zcf/e, otherwise 
the server is (required to be) busy processing some job. These are sometimes called 
work conserving systems, where a system cannot remain idle if there is work to 
be done, nor can it produce work (at any time). It can be shown that the single 
server work conserving queue always has a limiting distribution whenever the average 
interarrival rate is less than the average service rate ([20]). 

Some specific interarrival and service time distribution we use are exponen- 
tial, deterministic, and gamma distributions are denoted by £{Q), T>(c), and G(a,0), 
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respectively. To summarize distributional notation, 



For X ~ i 



D(c) 
6(6) 
G(a,0) 
V(X) 



P[X 

fix) 

/(*) 
P[X 



c] = 1 where c is constant 
de- 0x [x > 0] 
af > x fi - l e- ttX [x > 0] 
k] = £e- x [k € {0,1,2,...}] 



(2.2) 



We also use the notation [X e B} 2 to denote the indicator or Bernoulli random 
variable taking on the value 1 if the value of X lies in a set B and zero otherwise. 
As is customary, we abbreviate the phrase "independent and identically distributed" 
with iid. 

The scheduling discipline defines the decision rules the processor uses when 
selecting which among the currently queued jobs to process. Unless otherwise stated, 
we consider only the First-In-First-Out (FIFO) scheduling discipline, where jobs are 
processed to completion in the order in which they arrrive to the system. Two pro- 
cesses of frequent interest are the queue length process and the virtual waiting time 
process. The waiting time is the time a job spents in the queue before it reaches 
the server (for the first time). The virtual waiting time (defined at a point t) is the 
waiting time a job would experience if it arrived at time t 

Sometimes it is easier or more informative to consider two closely related 
processes, the response or system time and the system size which is the number of 
tasks in the system. As one might readily guess, the number in the system is zero 
when the server is idle, otherwise it equals the number in the queue plus one. The 
response time is the time spent in the queue plus the time spent in service. When 
a service is uninterrupted, the response time is then the convolution of the waiting 

2 Donald Knuth adopted this notation, which we find convenient when expressing integrals. For 
example, Pr(X e B) = J Q [X(u>) 6 B] <L>. 



CHAPTER 2. SOME QUEUEING THEORY RESULTS 



13 



time and the service time. When service times are interrupted (such as in priority 
queueing or in some non-FIFO service disciplines), the response time distribution is 
more difficult to obtain, and often has no known analytic solution. 

When refering to limiting distributions, we denote response times, waiting 
times, queue lengths, and number in the system by W, and N, respectively. For 
example, Q{q) = P(Q < q). Q and N are non-negative discrete random variables, 
R is non-negative and continuous, and W is non-negative, and everywhere contin- 
uous except at 0 where W^O) > 0. The notation we use for derived steady state 
distributions, their parameters and moments are summarized in Table A.l. 

It is common practice to use the same mnemonics to refer to state variables 
describing the process as it evolves over time. The context will (hopefully) make 
clear the meaning. For example, Q(t) = Q t is the number in the queue at time t 
To complicate things slightly, we adopt the common notation W(t) = W t to mean 
the work in the system at time t. In fact, W(t) is the virtual waiting time for a 
FIFO queue, which is the waiting time if an arrival were to occur at time t. We also 
sometimes use the notation A(t) = A t and D(t) = D t to represent the number of 
arrivals and departures in [0,£], respectively. Again, the context should make clear 
the meaning. State variables are summarized in Table A. 2. 

Within a single task stream, we consider only non-preemptive disciplines. For a 
single server work conserving queue, both the number in the system and the number 
queued are invariant under different service disciplines. When choosing to start a 
new task, selecting an arbitrary task is equivalent to selecting the task at the head 
of the queue, since the execution times are iid. Thus, queue length distributions are 
also of interest since they can give measures of system saturation independent of the 
service discipline. For example, the probability the system is idle is the probability 
the system contains no customers. In some cases we will develop estimates for queue 
length distributions. For the FIFO discipline, the relationship between waiting time 
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and the number in the system is relatively straightforward. 



2.2 The M/M/l Queue 



For the M/M/l queueing model, both A and B are exponential, with interarrival and 
service rates A and respectively. The M stands for Markov. So A(t) ~ £ (A) and 
B(x) ~ £{/j,). Steady state solutions for the distributions of R,W,Q,N, and many 
other quantities all have analytic closed form solutions in the M/M/l queue and can 
be found in many introductory texts (e.g. [20], [21], [38]). We develop just a few of 
the results we use, then cite and summarize others in Table B.l. Transient results 
require greater analytic machinery, but many results are known ([35]). 

When solving for N, system state is simply the number of tasks. The steady 
state distribution of N is found by solving the stationary equations n = ttN, with 
boundary condition Tre = 1, shown in Equation 2.3. 



Writing system utilization as p = A//z, then solution to Equation 2.3 is easily calcu- 
lated and given in Equation 2.4 



Xn k + /zn* = tm k +i + An^i for k G {1, 2, 3, ....} 

Ano = im x with 

SfcLo n * = 1 38 boundary conditions 



(2.3) 



n k = P(N = k)=p k (l-(>). 



(2.4) 



For FIFO queues, the response time distribution of the M/M/l queue is directly 
caclulable once recalling that the sum of k random variables, each £(/z), is ~ Q{fx> k) 
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and given in Equation 2.5. 



P(R < x) = Er=i p (R < x\N = k)n k -i 



(2-5) 



= 1 — e~k*~ A K 



Two things are worth noting about Equation 2.5. First, it is only valid for the FIFO 
service discipline. 3 Second, it is correct to condition on queue length for a newly 
arriving task since Poisson Arrivals See Time Averages, a fact we will find useful. 4 
This property is more tersely referred to as the PASTA property, and it applies quite 
generally. A precise statement of the PASTA property can be found in Lemma C.2.L 
It is often possible to explicitly calculate first moments of first passage times 
for the M/M/l queue. One value of interest is the mean time to first transition from 
state k + 1 to state A:, for k € {0, 1, 2, ...}, which we denote by E[k + 1 -» ft]. It can 
be shown [19] that 



An application of Equation 2.6 is the calculation of the expected time of a busy 
interval. Suppose a task arrives at time t to an empty queue, at which point the 
process state is one. The busy interval ends when process state first reaches zero 
after time t Equation 2.6 tells us that E[l -» 0] = (/x - A)~ l . It is interesting to 
note that the mean response time is equal to the mean time of a busy interval. The 
Markov property of the M/M/l queue gives us a generalization to Equation 2.6 shown 

3 In fact, for non-FIFO service disciplines, the response time distributions are generally not known 
for even for the M/M/l queue. 

4 For example, the PASTA property implies that the virtual waiting time is the waiting time for 
any Poisson arrival process. 



E[k + 1 -> k] = (fi - A) 



- 1 for k E {0,1,2,....}. 



(2.6) 
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in Equation 2.7. 

E[k + m -)• k] = m(fi - A)" 1 for k € {0,1,2,....} and m € {1,2,3,...}. 

(2.7) 

Much more is known about the M/M/l queue, and more generally about 
continuous-time birth-death Markov processes. For example, the distributions of the 
queue length, the number served in a busy interval, and of the busy and idle period 
durations are also known. Results we use are summarized in Table B.l for easy 
reference. 

2.3 The M/G/l Queue 

In the M/G/l queue, the arrival process is Poisson, but the service times can be 
general. G stands for general (i.e. G could be any off, Q, V or other distribution). 
For the M/G/l queue, the interarrival time distribution is £(A) and the service time 
distribution B has mean ra& and variance a\ < oo. The results cited below are well 
known and can be found in most texts (e.g. [20], [21], [11]). 

The M/G/l queue is of interest to our problem, since a priority queue can 
often be approximated by reducing it to an M/G/l queue using the concept of a 
completion time. A completion time for a task r is measured as the time from the 
first instance r receives service to the time at which r completes. Completion times 
and their use are described more in Section 3.2. 

The M/G/l queue is a semi-Markov process, containing an imbedded Markov 
Chain for the number of customers in the system defined at task departure instants. 
This is the result of Lemma 2,3.1 

Lemma 2,3.1 (An M/G/l Imbedded Markov Chain.) Since the service time 
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distribution is not exponential, transition probabilities will depend on the time a task 
has been in service. So, any process defining system state must contain information 
equivalent to time since start of service. For example, [N($),Xq(s)], where X Q (s) = 
the time current task has been in service, is a continuous state Markov process (with 
sample paths that jump). 

Consider the set of points at which departures occur, say $ lf $ 2 , £ 3) .... At these 
points, Xo(s n ) = 0, making {N(s n )} a sample realization of a Markov chain. 

Let Vk = the number of arrivals to the system during the execution of task 
Tfc. 5 The defining equations for the number in system in the imbedded Markov chain 
developed in Lemma 2.3.1 is given by Equation 2.8 



N k + v k+l -l forjV fc >0 
Ufc+i otherwise 



Using the relationship in Equation 2.8 and moment generating functions, 
results for the M/G/l queue are in terms of the service time LaPlace transform, 
B(s) = E[e~ 8X ]> where X is the service time random variable with distribution B. 
The LaPlace transform of the number in the system is given by 

N(z) = mi-z)) £~ P){1 ~ Z) • (2-9) 

The development of Equation 2.9 was based only on departure instants. However, an- 
other important result holds for the M/G/l queue which we highlight in Lemma 2.3.2. 



5 The density of v k is simply P{v h = m) = / 0 °° ^^e^dBix), for m 6 {0,1,2,...} and k e 
{1,2,3,....}. 
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Lemma 2.3.2 (Departures See Time Averages in the M/G/l Queue) In the 

limit, the number of times the imbedded Markov departure time process transitions 
from k to k + 1 differs by no more than one from the number of transitions from 
k + \ tok. So given the PASTA property for arrivals, one might expect it to also hold 
for departures. For a simple but more formal and convincing argument, see [11] (pp. 
221). 

Application of Lemma 2.3,2 renders Equation 2.9 valid for all time points. 
Inverting Equation 2.9 remains. This is sometimes possible by inspection (e.g. N 
for the M/D/l). In theory, numeric inversion is an option when the service time 
distribution has a density with respect to Lebesgue measure. 6 However in practice, 
only moments are usually sought. 

Under the assumption of FIFO service, the LaPlace transform for the response 
time (which is the product of the transforms for the waiting time and service time) 
can also be found and is represented in Equation 2.10 

R(s) = B(s) . (2.10) 
$ - A + XB(s) 

Much more is known about the M/G/l in steady state. Its transient behavior is 
treated in detail in [35]. 



2.4 Numeric Methods 

Most practical queueing problems are not described by stochastic processes that sub- 
mit nicely to analytic solutions. Instead, they are solved using one or more of a 
variety of numeric methods. Beyond this section, we won't focus on numeric solution 

6 Of course, a density can be approximated to an arbitrary degree of precision by using very steep 
slopes about points of discontinuity. 
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methods since their solution rarely provides insight into the nature of the problem, 
and inferences can rarely be made beyond the particular parameter settings for which 
the problem was solved. 

We will identify a handful of properties that can greatly complicate the prospects 
of analytic solution. We will later see that our queueing models exhibit several of 
these complicating properties. We also mention a few of the more common solution 
techniques, some of which were used in Section 2.3. 

When a queueing model is non-Markovian, efforts to somehow convert it to a 
Markov process for which known results apply are often tried. Among these conversion 
techniques are approximation by diffusion processes, the method of supplementary 
variables, the method of imbedded Markov chains and the method of stages. 

Examples of using diffusion approximations for queue length and virtual wait- 
ing time processes for the G/G/l queue (near saturation i.e. p = A(ju)~ l « 1) are 
developed in Section 2.5. In Section 2.3, the number in the system process, N(i) was 
first converted to a continuous state Markov process by adding a new state variable 
in the process state description. Now N(t) = [N(t),X 0 (t)] represents the (system 
size) process state at time t. Increasing the dimensionality of the state vector with 
additional information so the relevant past information is contained in the current 
state is known as the method of supplementary variables. 

[N(t) } Xo(t)\ is a continuous-state Markov process (when B(x) is absolutely 
continuous), and solving continuous-state Markov processes is still a formidable task. 
The method of imbedded Markov chains involves looking at the process only at points 
defining an imbedded Markov chain. In the M/G/l queue, we saw that departure 
instants formed one set of time points defining an imbedded Markov chain. Similarly, 
for the G/G/l queue, the start of idle intervals defines a set of imbedded points ([36]). 
The behavior at these imbedded chains may, but typically will not, be the same as 
overall system behavior. 
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Sometimes states are introduced to approximate a system of interest. For 
example, a constant random variable X ~ V(D) can be approximated by a gamma 
random variable G ~ Q{a = k } (3 = D{kfj)) 1 when letting k — ► oo. To model 
a deterministic execution time, a task's execution consists of k sequential identical 
phases, where each phase is exponentially distributed with rate /z. As k increases, 
the approximation improves. This approximation example illustrates what is known 
as the method of stages. The state space remains discrete, but the state description 
now contains a new state variable (i.e. a supplementary variable) describing the 
stage of execution. The original continuous state process has been approximated by 
a (potentially large) discrete state Markov process. 

Using discrete state methods to approximate an inherently continuous phe- 
nomena (such as response times) is likely to result in a very large state space, where 
numeric solution is required. There are a number of available numeric solution tech- 
niques (but numeric instability is not uncommon) for solving (possibly very large) 
sets of linear equations, which can be used for finding the stationary distribution of 
the Markov chain. Assessing the quality of the approximation (to the original model) 
is yet another problem that also need not lend itself to analysis. 

An alternative approach is to use a continuous state process to define the 
queueing model. All the queueing processes we have considered so far have renewal 
processes that form the input flows, and when the server is busy for extended peri- 
ods of time, the output flows. The interarrival and service time distributions have 
depended on neither time nor system state. When we consider the background and 
foreground priority server models as a reduction to an M/G/l queue, the completion 
times will depend on both time of arrival and on system state. 

When the process is non-Markovian (which it is in our problem), there are 
no general solutions for performance distributions. The best we can hope for is 
special case analyses or approximation by continuous state Markov processes, which 
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constitutes much of the material in Section 2.5. 

2.5 The G/G/l Queue 

For the G/G/l queue, both the interarrival time and service time distributions are 
arbitrary. When the service time distribution is exponential, the model is a G/M/l 
queue for which many analytic results have been found. However, when using com- 
pletion times to approximate a preemptive priority queue, the distribution for the 
completion time is not approximately exponential, so the many known results for 
G/M/l model are not useful to us. Without further restrictions on either the in- 
terarrival time or service time distributions, not even first moments of performance 
measures need have (known) closed form solutions. 

2.5.1 G/G/l Process Formulation 

A direct and commonly presented representation for the G/G/l queue is Lindley's in- 
tegral equation (also known as a Weiner-Hopf type equation). An analysis of Lindley's 
integral equation requires analytic machinery well beyond the scope of this (student 
and her) thesis. Nonetheless, a sketch of the development of Lindley's integral equa- 
tion proves worthwhile since it forms the basis for a compact representation of the 
queueing models we will study when the hyperperiods are intermediate in length. 
The development presented here largely parallels that given in [21]. 

The development of Lindley's equation applies to waiting times, the time a 
task spends in the queue before starting service. In addition to the notation in 
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Equation 2.1, we make use of the following defining relationships: 

U n — X n — T n+ i = system change in work by r n relative to A n+ i 

W n = time r n spent waiting before starting service 

C n (u) = Pv[U n <u] 

Wn{y) =Pr[W n <y] (2.11) 

Wn+r = max[0,W n + U n ] = (W n + U n )+ 

It is the last relationship in Equation 2.11 that we use as a basis for our process 
simulation model. Here is one possible approach to numerically solving for the limiting 
waiting time distribution. By definition, 

C n (u) = f°° P[X n < u + *|T n+1 = t]dA(t). 

Jt=0 

Noting that X n is independent of T n+1) it follows that 

poo 

C n (u) = C{u) = B(u + t)dA{t). 
Jt=o 
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Also, for y > 0 and W n+l (0~) = 0, 



W n+l (y) 



= J™P[U n <y-x\W n = x] 



■]dW n (x) 



by definition of W n +i 



= J£C n (y-x)dW n (x) 
by independence of U n and W n , so 



W(y) 



= £C(y-x)<W(x) 



(2.12) 



by Equation 2.11 and def of lim^ooW, 



n 



= -/ 0 °f^(x)dC(y-x) 



by integration by parts 



= IlooWiy - x)dC(x) 



by a change of variables. 



The three relationships defining W(y) in Equation 2.12 are different forms of Lindley's 
integral equation, a type of Weiner-Hopf equation. Not even an explicit expression 
for W, the mean wait is known without making some assumptions about the general 
service and interarrival distributions. 

It may be worth noting the PASTA property need not hold for the G/G/l 
queue. For example, consider the D/D/l queue where the number of tasks an arrival 
sees in the system (not counting the arriving task) is zero with probability one. If 
we were to pick a random time and then observe the number of tasks in the D/D/l 
queueing system, we would see one task with probability C/H and zero tasks with 
probability 1 — C/H (which is the distribution an arrival would see if the PASTA 
property were applicable). 

As an alternative to an exact analysis, the queueing process can sometimes 
be well approximated by a simpler process, for which analytic solution is possible. 
The remainder of this chapter discusses diffusion approximations for queue length 
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and waiting time distributions. 

2-5.2 G/G/l Heavy Traffic Approximations 

In certain eases, diffusion approximations have been shown to provide reasonable 
approximation of queueing models under heavy traffic conditions. Under heavy traffic 
conditions, the system is near saturation (i.e. p = Xf/a « 1). Figure 2.1 is an 
illustration of the plausibility of this approximation, which shows the queue length 
process of an M/M/l queue as it evolves over time. Note that the horizontal time 
axis, with t 6 [0, oo), acts as a reflecting barrier since queue lengths never go negative. 



Qutut L»n0tH as a Function of Tim*; utll =s O.ddO 
©O t — ■ — ■ — i 1 1 "i r 




x— axis «=* Time 



Figure 2.1: Queue Length Sample Path in Heavy Traffic 

Steady state solutions for approximating the G/G/l (and much more general 
queueing structures) have been found when the diffusion coefficients are constant (i.e. 
independent of state and time). 

A common strategy for studying the G/G/l queue in conditions of heavy traf- 
fic is to appropriately scale time (t -> [t/n]) and space (e.g. queue size, q -> tf(n~ 1//2 )) 
in the original queueing system, creating a sequence of random functions that con- 
verge weakly to a Brownian motion. Donsker's theorem is included in Section C.4 and 
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provides the underlying mathematical machinery needed to obtain the weak conver- 
gence results. To obtain the equilibrium distribution (when one exists), the limiting 
diffusion process needs to be evaluated as t — ► oo. 

Whitt ([37]) appears to have been the first to find diffusion approximations 
using weak convergence limit theorems for a preemptive fixed priority queue. Burman 
([5]) used the method of supplementary variables to find a diffusion approximation for 
the G/G/l queue length process. Harrison ([13]) presents a comprehensive treatment 
for finding solutions to networks of queues with multiple traffic classes (but none 
have priorities), and an arbitrary routing structure. Harrison ([14]) also presents an 
analysis for the transient solution of a Brownian motion with a reflecting barrier at 
the origin, which has applicability to modeling transient heavy traffic behavior of the 
G/G/l queue. 

Solutions for diffusion processes with time and state dependent coefficients 
requires solution techniques for stochastic partial differential equations, which some- 
times can be solved using the Ito calculus ([2]), but typically can only be numerically 
estimated. 7 These solution techniques are well beyond the scope of this (student and 
her) thesis. The literature on the subject is enormous but results for applications of 
even modest complexity (such as those in this thesis) are mathematically inaccessible 
to most and appear largely unconsolidated when they exist. 

Later our simulation studies will reveal that the preemptive fixed priority 
diffusion approximation is often excessively optimistic when the mean aperiodic and 
periodic service times are not of the same order of magnitude. However, we will find 
cases where a diffusion process is a reasonable estimator of blocking times, so it is 
instructive to present the results, and sketch some of the development. 

7 The Ito calculus is also applicable for stochastic differential equations with constant coefficients. 
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2.5.3 Approximating the System Size Process 

Define processes 

A t = number of arrivals in [0, t] 

T t = amount of time spent busy in [0, t] 

D t = number of departures from an ever busy server in [0, T t ] and 

(2.13) 

D t = number of departures in [0,tj. 

A u D u and D t are counting processes. T t has been called an allocation process and it 
defines the amount of time allocated to serving processes. By definition, N t — A t -D t . 
We call N t the system size process to distinguish it from the queue length process, 
Q t . Some authors refer to both N t and Q t as the queue length process, providing 
context as a means for differentiation. 

We now compute the coefficients for the limiting diffusion process. The asymp- 
totic means and variances of A t and D t are easily computed, since they are count- 
ing processes of renewal processes. Specifically, as t -» oo, E[A t ]{t)~ l = A and 
E[D t ]{T t )' x = ju. An application of the CUT for renewal processes (see Lemma C.l.l) 
gives Var(^)(0" 1 = °l>? and Vw(D t )(T t )- 1 = a 2 bf x\ 

Under conditions of heavy traffic, the server is usually busy, so T t « t and 
D t « D t . Analogously, we approximate N t by A t - D t . Further, since some queueing 
usually occurs, the process D t is largely independent of the process A t . This is in 
contrast to a lightly loaded server, where the departure process is highly dependent 
on the arrival process. 

Assuming independence of A t and D t > the diffusion coefficients for process N t 
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are now defined by a simple calculation, 



m N = E[N t ](t)- x = (A - m)> and 

*%= Var^K*)" 1 = (Var^ + VarTO)^ 1 ^ (^A 3 +a 6 V). 



(2.14) 



The reader unfamiliar with weak convergence arguments is encouraged to first 
review Section C.4, which also provides some notation. It turns out that ([2], [20], [22]) 
in the limiting process, the transitions are defined by the Kolmogorov forward diffu- 
sion equations (also called the Fokker-Planck equations) in Equation 2.15. 



Fortunately, Equation 2.15 has been solved under a number of boundary conditions 
when m(x,t) = m = m N and cr 2 (a;,t) — a 2 = a%. When \ > fj, the queue grows 
without bound, and no proper steady state queue length distribution exists. However, 
for A > /i, we might want to consider the queue length after some long period of time. 
In this case, we have 



We will find application for Equation 2.16 for describing aperiodic queue length at 
the end of a blocking time under the aperiodic background priority server discipline. 
The diffusion coefficients will, of course be different. 

When A < /x, the boundary condition p(x, t\x 0 ) = 0 for x < 0, t > 0, and 
x 0 > 0 corresponds to a reflecting barrier at the origin (queue lengths and waiting 
times are never negative and reflect when reaching the origin). Under these conditions, 



dp(x,t\xo) _ d 
. dt dx 




(2.15) 




(2.16) 
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the solution to Equation 2.15 is given by 

P(N°° < x) = 1 - e 2mx/ " 3 for x > 0 so iV 00 - £ (2ma" 2 ). (2.17) 

In the special case of the M/M/l queue, m = (A — /x) and a 2 = (A + /^). For 
A < ju, Equation 2.17 applies giving 

Pr(iV < *) = 1 - C -^ A )^" 1 * for k > 0. (2.18) 

Since we are approximating a discrete random variable by a continuous one, 
Equation 2.18 will differ from an exact solution, which for the M/M/l queue is 

Pr(N < k) = 1 - p k ^ = 1 - pp k . (2.19) 

When p « 1, it suffices to show that e - 2 (^-^)(M+A)- 1 w ^ To see t j iat t jj e ex p ress i on 
in Equations 2.18 reasonably approximates the value in Equation 2.19, take fi = 1 
and the first two terms in the Taylor's expansion of e- 2 ( 1 ~ A )( 1+A )" l J giving 1 — 2(1 — 
A)(l + A)" 1 « A, or (1 — A) 2 ^ 0 which follows from the assumption A « = 1. 

2.5-4 Approximating the Virtual Waiting Time Process 

To calculate the virtual waiting time of a FIFO queue, we introduce several new 
process definitions using variables and processes previously defined in Equations 2.1 
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and 2.13. Let 

Xt = xi + X2 + ... + xa % = cumulative work requested in [0, t] 
Y t = X t -t 

It = — info<*<tK* = cumulative idle time in [0, t] 

T t = t~- I t = cumulative busy time in [0, t] and (2.20) 

W t = X t — T t = pending work at time t. 

The process definitions in Equation 2.20 apply in much greater generality than we have 
considered. Assume the queue begins empty, so Wq = 0 and the number of departures 
cannot exceed the number of arrivals, in which case W t = Xpi+i + XD t +z + »• + #a*- 
Letting t oo 

m w = lim^oo^^Kt)- 1 = (E[Eto t+ x ^K*)" 1 

= {E[Dt + i] _ = (1 - p ) + Um^fot)- 1 

= (1 - p), (2.21) 

where the second line in Equation 2.21 follows from Wald's identity and p= X/p. 
Now for n € {1,2,3, ...} define sequences of processes: 

W n (t) = \W{nt) - (p - l)nt}n- 1 ' 2 , 
An(t) = [A(nt) - \nt)n- 1 ' 2 and (2.22) 
S n (t) = Efi(^ " »- x )]n-"\ 

The following weak convergence results have been shown ([37]). 



S n => S where 5 ~ cr 6 f 1 
A n => A where A ~ (A 3 ©*)" 1 ' 2 ^ 8 , and 



(2.23) 
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where f 1 and £ 2 are independent Wiener processes. (See Section C.3.) From Equa- 
tion 2.23, the variance is easily calculated. Vax(W n ) = AVarS + ^"~ 2 VarA = Aof + 
//~ 2 A 3 <7 2 . Summarizing, the diffusion coefficients for the waiting time are 

m w = (p- 1) and a 2 w = Aa 2 + //- 2 A 3 a 2 . (2.24) 

For mw < 0, the limiting distribution of W°° is found using Equation 2.17 
with coefficients given in Equation 2.24. Equation 2.25 gives the general case, 

Pv{W°° < t) = 1 - e-W-PX^l+^vl)- 1 * for t > 0 . (2.25) 

For the special case of the M/M/l queue, the limiting virtual waiting time 
distribution becomes Equation 2.26. 

Pv(W°° < t) = 1 - e - (1 "^ 2A " lt for t > 0. (2.26) 

The exact waiting time distribution for the M/M/l queue is given by 1 - pe~b*->)*. 
Under conditions of heavy traffic, p w 1, so 1 — pe~^~ x ^ x « 1 — e"^ M ~ A ^^ A ^, which 
is Equation 2.26. 



Chapter 3 
Priority Queues 



In this chapter we introduce notation and modeling constructs helpful in the analysis 
of priority queues. We also define typical measures of performance for priority queues, 
and give several known results for the single server queue with multiple sources of 
traffic. Results that we use later are included in this chapter for future reference. 

3.1 Performance Measures and Notation 

Our system now consists of a single server with r traffic classes, each originating from 
a distinct traffic stream outside the system. In our case, r = 2, but many of the 
concepts generalize. In addition to the server, there is queueing. One can think of 
either a single queue that is sorted by priority, or of r distinct queues where jobs of 
class j queue in the j th queue. We have tried to consistently denote periodic subscripts 
by 1 and aperiodic subscripts by 2. 

Definitions for parameters and the limiting performance distributions intro- 
duced here are summarized in Table A.l. The notation conventions introduced in 
Chapter 2.1 are extended to include a subscript for the traffic class. For example, 
R c would refer to the steady state response time distribution for class c tasks. For 
X a random variable, or F a distribution function, we use the notation X ~ Y and 
Y ~ F, when X = Y a.s. or when Y has distribution F, respectively. 

Each job class j € {l,...,r} has an interarrival time distribution, Aj(x) = 
Pr(Ti\n < x), n € {2,3,...} where T j)n is the time between the (n — l) st arrival of 
class j and the n ih arrival of class j. The arrival process for each class then forms 
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a renewal process. Since the only classes of interarrival processes we consider are 
either exponential, or deterministic (starting the system with a deterministic arrival 
at zero), we may assume that the first interarrival time distribution is equal to those of 
ail subsequent interarrival times. Since we are interested in equilibrium distributions, 
it turns out that we may make the same assumption for more general arrival processes. 
All arrival processes are assumed to be independent. 

The mean time between T^ n and 2j >n+1 is denoted by Xj l . Xj is called the 
arrival rate for class j. A counting process NAj(t), counting the number of class j 
arrivals in (0, t) is used when defining the queue length and other processes. 

Recall (from the Introduction) that the scheduling discipline defines the deci- 
sion rules the processor uses when selecting which among the currently queued jobs 
to process. For priority queues, when a departure occurs, the processor will always 
select the job with the highest priority to start processing next. A preemption occurs 
when the processor is serving a class j job class and a job with higher priority arrives 
or otherwise becomes available. Note that preemptive schedulers need not provide 
uninterrupted service to jobs. Job priorities can be either fixed or dynamic. A fixed 
priority does not change with time, whereas a dynamic priority does. The background 
and foreground aperiodic service disciplines are examples of fixed and dynamic priority 
scheduling disciplines, respectively. 

Similarly, each job class j G {1, r} has a service time distribution Bj(x) = 
Pr(Xj tTl < x), n € {1,2, ...}. The service time distributions are also assumed to be 
independent of each other and of the interarrival processes. The mean service time 
(in the absence of interruption) for class j is defined by mj. The mean service rate 
(in the absense of interruptions) for class j is fZj = m" 1 . An analogous counting 
process for completed services N Dij (t) is defined by the number of class j departures 
(or completed class j services that have occurred in (0,t)). 
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Unless otherwise stated, we consider only the First-In-First-Out (FIFO) schedul- 
ing discipline within each job class. In other words, a class j job, once started will 
be completed before any other class j job is started. And they will be started in the 
same order in which they arrived. 

The notation used to describe the process' state variables is summarized in 
Table A.2. Again note that many of the same symbols (with perhaps different sub- 
scipts) are used to define limiting distributions as state variables. The processes of 
typical interest are once again the queue length process (or system size) extended in 
the natural way to Q(t) = (Qi{t),Q2(t)i — >QrW), and the virtual workload process 
W(t) = (Wi(i), VK 2 (i), W r {t)). An unsubscripted variable might also denote a sys- 
tem variable. For example, Q(t) = Qi(t) + Q 2 {t) is the total number of tasks queued 
at time i. 

When priorities are fixed, all the results in Chapter 2 apply to jobs of class 
1 (i.e. jobs with the highest priority), in which case interest is focused on the lower 
priority classes. For jobs of class k > 1, the class k virtual workload differs from the 
class k virtual waiting time, since in a preemptive priority discipline all jobs of class 
j < k will receive service prior to providing service to any jobs of class fc. Also, the 
execution of class k jobs will be interrupted by any new arrivals of class j < k. In 
other words, Wk is the virtual waiting time a class k job would see only if Wj = 0 for 
all j < k and there are no new arrivals for any class j < k while the queue contains 
class k jobs. 

3.2 Completion Times 

When service within each stream is FIFO, the concept of a completion time can be 
used to (approximately) transform the analysis of a preemptive priority queue to an 
analysis of a queue with only a single traffic class. When the arrivals are Poisson in 
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the stream of interest, the priority queue can be studied as an M/G/l queue. The 
completion time of a task is defined to be the time between the first and last instants 
of execution. In other words, the start of the completion time begins when a task 
first transitions from waiting to service. In a preemptive priority queue, tasks can be 
preempted by higher priority tasks. The execution time of these higher priority tasks 
contributes to the completion time of the task being preempted. The completion time 
ends when the task exists the system. 

Equation 3.1 shows the probability of task type, when considering a task se- 
lected at random. 

P(periodic arrival) = and ^ ^ 

P (aperiodic arrival) = A ^ 

One case of applicability for modeling mixed periodic and random traffic is when ^ 2 > 
Hi and 0 Co < pi < 1 — Co, for Co some non-negligible portion of 1. In other words, 
when C = mi » m<i = 1> typically) and Cq < CH~~ l < 1 — cq a completion 

time will rarely contain more than one periodic execution time. In order for this to 
occur, we must have some X 2 ,n > (H — C) which occurs with probability e~^ H ' c \ 
For example, suppose 0.25 < pi < 0.75 and C > 12ra 2 ( = 12 • 1 when /x 2 = !)• Then, 
2.3 - 10" 16 < e-wW-O < 0.0184. 1 

When queue lengths are typically long, and a completion time contains only 
one blocking time, then when blocking occurs it will often be for the full blocking 
time B of the periodic traffic class. In the background aperiodic server discipline, 
B = C. In the foreground aperiodic server discipline, B < C, and sometimes it is 
much less. Let X be a random variable with the distribution of an aperiodic task's 
completion time. Under these conditions, the approximate mean and variance of X 



x When a completion time routinely contains multiple complete periodic compute times, a de- 
graded server model (described in Chapter 4) works reasonably well. 
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are given in Equation 3.2 where A = Ai + A2. 

4- AM = mi 4- 4*777.0 and 

(3.2) 



E[X] « ^#[*i] + + X 2 ] = mi + ^m 2 and 



3-3 Poisson Arrival Processes 

In all literature reviewed, when developing an exact analysis for priority queues, a 
number of commonly made assumptions were used for tractability. First, all input 
streams were assumed Poisson (each with a constant arrival rate). 2 Second, priorities 
are usually fixed (not dynamic) and within each class, service is typically FIFO. Third, 
service times are independent of time, system state, and of the arrival processes. Also, 
the service time density is often assumed to be absolutely continuous with respect to 
Lebesgue measure. Many of these assumptions can be relaxed when going to heavy 
traffic approximations, but are necessary when considering an exact model if any 
tractable solution is to ensue. 

The past 30 years have been explosive in terms of developing new models for 
networking behavior (which can sometimes be modeled as either processor sharing 
or stages of servers). Priority queues have been found to be useful for efficiently 
scheduling periodic tasks with hard deadlines ([27], [23]). In recent years, considerably 
less effort appears to have been expended on finding exact probabalistic solutions for 
priority queues. Using either approximating models (such as heavy traffic) for which 
analysis is possible, or applying numeric methods for solutions to specific parameter 
settings appears to be much more commonplace. 

Jaiswal [17] solves for the standard performance metrics when all arrival pro- 
cesses are Poisson and service time densities are general, except with infinite support 



2 Alternatively, all service time distributions are exponential. These memoryless assumptions lead 
to independence among idle and busy intervals. 
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and absolutely continuous with respect to Lebesgue measure. He considers different 
preemptive disciplines (e.g. resume, repeat) as well as non-preemptive service disci- 
plines. In all cases, when there are two traffic classes, solutions are found in terms of 
LaPlace transforms (much like for the M/G/l queue). When the service discipline is 
non-preemptive, FIFO within each class, and service and interarrival times are inde- 
pendent and exponential, the LaPlace transform for the waiting time distribution for 
each class is known [31). 

An application of Lemma C.2.1 allows us to snapshot system performance 
metrics only at the arrival times of a priority class with Poisson arrivals, from which 
system behavior can be inferred at all other time points, even when other (higher 
priority) traffic does not have Poisson arrivals. Note also that for each class with 
Poisson arrivals, the virtual waiting time distribution is the same as the steady state 
waiting time distribution. 

3.4 Heavy Traffic Approximations for Preemptive 
Fixed Priority Queues 

In this section we describe heavy traffic approximations for a preemptive resume fixed 
priority queue. In many ways, the development parallels the heavy traffic approxi- 
mation for a queue with a single traffic class so we focus primarily on defining the 
diffusion coefficients. These definitions are from [37]. In theory, this model describes 
our background aperiodic priority model. In practice, we will find that the model 
is often too optimistic (i.e. at system utilization p < 0.99). 3 The development here 

3 As p f 1, the HTM approximations will improve. However, in practice one would not choose 
p — 0.9999999 for a nominal value, for example. In Chapter 5 we propose a practical criterion which 
can be used to define conditions of "heavy traffic". Later in this Chapter we also consider transient 
overloads where p > 1 for short periods of time, but long term p < 1. 
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does not apply to our foreground aperiodic priority model, since task priority is 
dynamic. 

For fixed priority systems, only the lowest priority class is in heavy traffic. 
For all priority classes other than the lowest, it turns out that queue lengths (and 
hence waiting times) all converge to zero when using heavy traffic theory. For the 
lowest priority traffic class, Whitt [37] finds that both system size and traffic load 
(i.e. waiting work) processes converge weakly to Brownian motion with a reflecting 
lower barrier at zero. The steady state distribution of the processes are found using 
Equation C.3.1 in Appendix A. 

3.4.1 The System Size Process 

High priority (periodic) tasks are class 1 and the low priority (aperiodic) tasks are 
class 2. The appropriate drift coefficient for the number of aperiodic tasks in the 
system is m = (A2 - £^(1 - pi))- The notation is introduced in Table A.2. Define the 
n th system size random function by 

^(nt)-(A 2 -Ml-Pl)M = A n _ (1 _ fii)B n + (3.3) 
712 

Under suitable conditions of heavy traffic, weak convergence arguments (see [37] and 
[4]) give 

and 

for independent Weiner processes f^S^S and £*, for i G {1,2}. 
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By repeated application of Lemma C.3.2, we have 
and the sought after result 

712 

(3-4) 

where the £'s are independent Weiner processes, and 

4 = 4 = (1 - Pi) 2 a 6 2 >2 ^, and 4 = /^(Ai^ + ^Afra.i)- 

In our case, Xi ~ V(C),Ti ~ V(H),X 2 ~ and T 2 ~ £(A), we have 

a b,i = a l,i = 0, ^6,2 = Mf 2 ; cr£ 2 = Aj~ 2 . Equation 3.4 then becomes 

N 2 (nt)-(X 2 - Ml- Pl ))nt ^ N » ^ + ^ _ ^ 2) ^ (3 5) 

Equation 3.6 gives the heavy traffic approximation for the steady state distribution 
of aperiodic system size (i.e. of N$° = 7V 2 ). 

Notice when there is no periodic traffic, p\ = 0 and Equation 3.6 agrees with our 
results from the previous chapter. 
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3.4.2 The Virtual Waiting Time Process 

With preemptive priority systems, the virtual waiting time process at priorities other 
than the highest is not the same as the workload process since higher priority tasks 
can arrive in the future which causes increased waiting times for lower priority tasks. 
Recall also for a preemptive fixed priority scheduling discipline, when the total system 
is in a state of heavy traffic, only the lowest priority class will be in heavy traffic for 

p<i. 

We begin by looking at the workload process, from which an approximation 
for the waiting time can be obtained. The notation we introduce in this section is 
also included Table A.2. We explicitly include here some of the constructions we 
use when developing state variable process simulations for response time estimation 
under intermediate hyperperiods. Equation 3.7 defines the random functions used. 

Mt) =max{fc€{0,l,2 > ...}|^}. 1 Ty<0 fori 6 {1,2} 

Xi(t) =Zti?X id for t€ {1,2} 

Y l (t) = X l (t)-t 

Y 2 {t) =X 2 {t) + mf 0 < a < t Y l (s) (3.7) 

Ii(t) = -mfo< s < t Yi(s) for i e {1, 2} 

Wi(t) =y i (t)-mf 0 < ! < ( F j (3) fort €{1,2} 

X? = n-*[Xi(nt) - pint] for i e {1, 2}. 

Under suitable heavy traffic conditions, the sought result for aperiodic class 2 work- 
load is (see [37] and [4]) 

^>t)-(m + *-l)n U xi+ ^ (3.8) 
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where just as before, 

so Equation 3.8 becomes 

1 ~ (a x + a 2 )*£ 1 (3.9) 

when £ is a Weiner process and of = A<of ^ + /i^Ajoj^ for z e {1,2}. 

When m = pi+p2~l<0) the (class 2) workload process is positive recur- 
rent and a steady state distribution exists. Application of Equation C.3.1 gives the 
limiting probability for the virtual waiting time of a class 2 job which is shown in 
Equation 3.10. 

Pr(W 2 00 = W 2 <x) = l- e 2m2Xa ~* (3.10) 

For the specific case where A x ~ V(\ x l ), B\ ~ ©(/zf 1 ), A 2 ~ £ (X 2 ), and B 2 ~ £(^2), 
diffusion coefficients become a\ = 0, a 2 = (7^ + = 2A2/W2 2 , m = P — 1- 
Applying these values to Equation 3.10 gives the class 2 workload distribution shown 
in Equation 3.11. 

Pr(W 2 °° < x) = 1 - e-t 1 -')"^- 1 (3.11) 

Equation 3.11 describes the distribution of pending aperiodic work, which is 
not the virtual aperiodic waiting time since it does not include the amount of pending 
periodic work, and more importantly it does not take into account future arrivals with 
periodic work that will preempt pending aperiodic traffic and increase response times. 
Since the (time averaged) future of the periodic stream is known, the effects of current 
blocking times and future preemptions can be approximated. 
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Another application of the PASTA property (Lemma C.2.1) gives us that the 
pending aperiodic work distribution is that of the virtual waiting time distribution, 
in the absense of any future blocking by currently present or newly arriving peri- 
odic tasks. To compensate approximately for future blocking of aperiodics, pending 
aperiodic work is merely inflated by (1 - pi)~ l . This adjustment is shown in Equa- 
tion 3.12. In the next chapter we will see how well and under what conditions this 
approximation works. 

R(x) = 1 - e -(i-Mi-pi)9»M- 1 * (3.12) 

On departure from this chapter we note that there may be long but transient 
durations when m — m w (t) > 0, in which case the virtual waiting time process for 
the aperiodic traffic class, WjjWj can be approximated (in the long transient window) 
as Af(m w t y alt). 



Chapter 4 

On Performance Models 



This chapter introduces the degraded server model (DSM) and compares it to our 
previously studied heavy traffic (HT) models. We will make frequent use of the 
DS and HT models in subsequent chapters. First, we elaborate upon our analysis 
objectives. 

4,1 Analysis Objectives 

Since we are guaranteed that periodic tasks will meet their deadlines, our focus is 
on the performance of aperiodic tasks. Suppose each aperiodic task has a deadline 
that is constant relative to its arrival time. For example^ if the constant deadline 
is d, 1 then for each task r n arriving at time A n , r n 's (implied) deadline is at time 
A n + d. For a (within class) FIFO server model, a desirable objective is to have 
R(d) = P[R < d\ > 1 - e, for some small non-negative e, where R is the aperiodic 
task response time variable (or cdf). An alternative objective might be to minimize 
R(d). When d is arbitrary, these two objectives are essentially the the same. In 
both cases we seek to characterize the tails of the response time (or system size) 
distribution. 

For our estimates to be applicable in practice, we seek a lower bound on re- 
sponse times. In other words, if R e is our response time estimate and R 0 is our 
observed estimate, ideally Pv(R € < r) < Pr(R 0 < r) V r. We did not attempt to 

l hi future work, the possibility of each task having a randomly chosen (non-constant) deadline 
is considered. 
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provide a strict bounding estimate for response time distributions, but we focus in- 
spection on values of r that constitute the right tail of the distribution, since these 
are the critical values that define limits in a system's design. 

Much emphasis is also placed on the analysis of a system that is near saturation 
(i.e. when p = pi + P2 « 1) since a more lightly loaded system will perform no worse 
than a heavily loaded system (for a given sample path). In other words, the response 
times distribuion of a system with p = 0.99 will be stochastically larger than that 
of a system with p = 0.85 (i.e. R p =o.99(r) < Rp=o.$$(r) V r). We also consider the 
applicability of models when the system is not near saturation. It turns out the ratio 
of mean aperiodic service time to the hyperperiod has a predominant effect on the 
type of analysis used to describe the data when the C periodic time units are requested 
all at once. 2 The magnitude of H/p^ 1 = H&i ls a significant factor in determining 
the appropriateness of different modeling techniques. We later characterize ranges 
for A2 as a function of rY/i 2 (= H, for p$ — 1) and p\ = CH~ l . 

4.2 Degraded Server Models (DSMs) 

We did not find the phrase degraded server model in the referenced literature, but it is 
such a natural model adaptation, that its use in practice is probably not uncommon. 
The idea behind a degraded server model is to simply degrade the service rate by the 
proportion of time it is unavailable for service to a particular traffic class. 

For an aperiodic service rate of /x 2 , and a periodic utilization of pi, the degraded 
aperiodic service rate can be modeled by p = p*i(l — making the mean service 
time appear longer. In other words, assume B2 ~ £{p)- The degraded server response 

2 We do not permit the case of a total of C time units requested in many increments over a period 
of H time units. In particular, when n requests are made each for C/n time units, this is the same 
as defining W = H/n and C" = C/n. 
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time model is then the M/M/l response time model with arrival rate A = A2 and 
degraded service rate \x = ji. 

When considering the aperiodic traffic in isolation, we have an M/M/l queue, 
for which the response time distribution is known and given by 

R(x) = P[R < x] = 1 - e - ( " 2 - A2 > = 1 - e~^ l - p2)x . 

When factoring in the periodic traffic, the degraded server model approximate re- 
sponse time distribution becomes 

R d (x) = P[R < x) = 1 - e - { *- X2)x = 1 - e^ 1 -"-^* - 1 - e-^-'K 

(4.1) 

Reference to the degraded server model for response times is reference to the response 
time cdf given in Equation 4.1. For background aperiodic service, we will find that 
the DSM provides a good estimator when the hyperperiod is short, but is overly 
optimistic when the hyperperiod is too long. For foreground aperiodic service, we 
will find that the DSM provides a good estimator when the hyperperiod is short, but 
is overly pessimistic when the hyperperiod is too long. 

One thing to notice about the degraded server model is that it does not preserve 
utilization in the sense that X/jl ^ pi + p2 = p. For example, when p = 0.95, 
p = \/jiz= 0.80, 0.90 and 0.93, respectively for A = 0.2, 0.45 and 0.70. 

The same degraded server substitution also gives an approximating model for 
the queue length distribution which is shown in Equation 4,2. 

N d {x) = P[N 2 < n] = 1 - p< n+1 > = 1 - ( gg )("+*), (4.2) 

The degraded server system size model is a good estimator of the number of aperiodics 
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in the system under the same conditions that the degraded server response time model 
is also a good estimator for response times when using FIFO within the aperiodic task 
stream. 

4.3 Heavy Traffic vs Degraded Server Models 

In the previous chapter, we developed a response time estimate approximation for 
background aperiodic service under conditions of heavy traffic. Equation 4.3 repeats 
that estimate. 

R h (x) = 1 - c-P-pX 1 -")^)- 1 *. (4.3) 

In the development of Equation 4.3, a degraded server assumption was made to 
account for currently present and future periodic tasks. We wish to compare Equar 
tion 4.3 with the degraded server response time distribution Rd{x) defined by Equa- 
tion 4.1. 

R h given in Equation 4.3 was derived under the conditions that p « 1, and 
otherwise cannot be expected to be reasonable. However, when applicable, heavy 
traffic approximations yield a nice closed form approximation for even general aperi- 
odic arrivals and general aperiodic service disciplines, when an exact solution is often 
intractable. 

In contrast, R d in Equation 4.1 can be expected to apply at all traffic loadings 
when the periodic compute time is suitably small compared to a typical aperiodic 
compute time. Later simulation results show that the DSM is much less optimistic 
than the HTM for moderately utilized systems (e.g. p = 0.85), and in fact provides 
reasonable estimates for much larger values of C provided the aperiodic queue length 
is long enough that most response times span multiple hyperperiods. 
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In this section we restrict our comparison to mostly utilized (e.g. systems with 
p > 0.95). Let p=/>i+p2==l~e and suppose there is enough aperiodic traffic 
and total traffic so that 0 < e < p2 holds. When comparing R d with R hi observe 
the range 1 < (1 - pi)p^ 1 = 1 + ep^ 1 < 2, since this is the factor in the exponential 
exponent that distinguishes the two response time distributions. Hence, we see that 
Rd(x) < Rn(x) for all x > 0. Since we are interested in finding estimates that error 
on the side of conservatism, we (somewhat arbitrarily) choose the DSM, when either 
the DSM or HTM are reasonable estimators. 

Let Ph = /x 2 (l — p)(l — P\){p2)~ x and fa = /x 2 (l — p) be the heavy traffic 
and degraded server response time distribution parameters, respectively. Similarly, 
let £0.99^ ^o.99,d be the 99 </l percentile of the HT and DS response time distri- 
butions, respectively. Table 4.1 lists values of these variables under different system 
and aperiodic utilizations when ^ 2 = L The relative error, |x 0 .99,a - #o.99,<* I /xo.994 is 
also included. It increases as p decreases. 



p=l-e 


P2 


l + c^)" 1 


Ph 


fid 


#0.99,h 


^0.99 ,d 


error 


0.99 


0.25 


1.040 


0.0104 


0.01 


442.80 


460.52 


0.038 


0.95 




1.200 


0.0600 


0.05 


76.75 


92.10 


0.167 


0.90 




1.400 


0.1400 


0.10 


32.89 


46.05 


0.286 


0.99 


0.75 


1.013 


0.0101 


0.01 


455.96 


460.52 


0.010 


0.95 




1.067 


0.0533 


0.05 


86.40 


92.10 


0.062 


0.90 




1.133 


0.1133 


0.10 


40.65 


46.05 


0.117 



Table 4.1: Coefficients for HT and DS Models 



4.4 Sampling Techniques 

We used two instantiations (i.e. streams generated by two different seeds) of an im- 
plementation ([12]) of George Marsaglia's universal random number generator ([28]). 
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Each sequence of uniform random numbers was transformed to a sequence of expo- 
nential random numbers which define the aperiodic interarrival and service times. 
Two sequences were used (rather than one) in an effort to simulate the assumption 
of independence among interarrival and service time distributions. 

For each system configuration (i.e. (A, /x, C, H) and one of FGA or BGA) only 
a single simulation run was made. For a fixed (A, fi, C, H), the same seeds were used 
for both the FGA and BGA runs, providing a control for comparing response times 
between BGA and FGA service disciplines. Otherwise different seeds were used for 
each system configuration. 

Since the aperiodic arrival process is Poisson, sampling of aperiodic response 
times occurs at departures. (See Lemma 2.3.2 for a justification.) There is consider- 
able correlation in response times between adjacent arrivals. There can also be con- 
siderable correlation between adjacent hyperperiods. The response time sample size is 
1000. After an initial transient period, every 1673 rd task's response time is observed, 
where 1673 seemed large to avoid correlated observations (often by skipping samples 
from nearby hyperperiods) for short and moderate length hyperperiods. The average 
number of task response times sampled per hyperperiod is i//(1763/A) = (H A)/ 1763. 
For H = 8 and A = 0.24, this is roughly one response time every 900 hyperperiods. For 
the largest hyperperiod we considered this is less than (0.74)32768/1763 < 14 tasks 
sampled per (one long, H = 32, 768) hyperperiod. As we will see, large hyperperiods 
act as probabilistic replicas, so this spacing gives in excess of 70 "nearly-independent" 
hyperperiods sampled. For the foreground aperiodic scheduling discipline, after some 
initial transience a minimum of 1500 hyperperiods were observed to obtain estimates 
of parameters used to define the blocking time distribution. 

Before looking at our mixed periodic and aperiodic scheduling problems, we 
compare simulation response time EDFs of an M/M/l simulation with their (known) 
theoretical response time CDFs. This comparison provides a null case for differences 
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we might (reasonably) expect to see between predicted CDFs and observed EDFs for 
mixed scheduling response times. 

4.5 Response Time Plots 

For an M/M/l queue, the variances of both the response time and system size are 
proportional to (1 — p)" 2 . Consequently, when pwla great deal of variability can 
be expected among sample paths, many deviating potentially significantly from the 
theoretical CDF. The theoretical response time CDF, 1 - e~^~ A)x , might be thought 
of as the "average" of a number of EDFs. Figure 4.1 shows several response time 
plots for a purely aperiodic (M/M/l) system when p — 0.95 (the left hand side) and 
p = 0.99 (the right hand side). These are examples of the null case, and suggest an 
amount of variability we might reasonably expect to see when comparing empirical 
response time distributions from our mixed periodic and aperiodic scheduling policies 
against our predicted response time distributions. Based on visual inspection, all 
three of the response time curves when p = 0.95 appear reasonably close, where as 
only two of the three when p = 0.99 appear to fit well. However, comparing CDFs 
does not emphasize differences in the right tails of the distribution. So this is just 
one type of comparsion we will make. 

4.6 Q-Q Plots 

In the previous section, our means of evaluating model fit was via visual of inspection 
empirical response time CDFs overlayed on the model select ion(s). In this section we 
describe Q-Q plots and illustrate their use as a means to better compare the right 
tails of the distributions to the predicted models in the null cases presented above. 
Let R be a theoretical response time distribution, and let E be an empirical 



CHAPTER 4. ON PERFORMANCE MODELS 



49 




MM COF -.; £nv COFi far j FFO MAM Qieue 




0 20 40 CO 60 100 120 

M*»-Sm». mi - 1.000; lantxto - 0.000 



UM COF -.;&np COFi a FFO Uii/1 Queue 




0 100 200 300 400 900 W0 

i-ob-Iim; fnj ■ 1 000; ftrbi* « 0 000 



MWCCF-.;&npC0F»_..;tof«FfOUAV1<>teue MM COF -.; Eay COFt far »FtfO tJAtft Queue 




Figure 4.1: M/M/l Response Time EDFs 
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response time distribution. A Q-Q plot has as its y-axis, E~ l (p) for 0 < p < 1 and 
i?~*(p) for the x-axis. Ideally, a Q-Q plot will produce the line y = x. Observed 
differences in the right tails will be larger than observed differences in the left tails, 
since the response time CDF domain values increase as p t 1. Since we want the 
empirical CDF to be stochastically larger than the predicted CDF, in the case of a 
mismatch, the preference is for the Q-Q plot output to fall below (i.e. to the right 
of) the line y = x. 

The Q-Q plots corresponding to the response time curves in Figure 4.1 are 
shown in Figure 4.2. As expected, the largest differences tend to occur in the right 
tails. The inverse of the theoretical M/M/l response time curve involves taking the 
log of a number near zero, giving rise to an infinite slope at p = 1.0. 

The asterisks are placed at values for which p = 0.95, p = 0.99 and p = 1.0. 
For p = 0.95 (the left hand side), the agreement is fairly good to the 95 th percentile. 
Between the 95 th and 99 th percentile, the agreement is still moderately good, and 
typically there is one or more noticable points of departure between the 99 th and 
100 th percentile. In all three cases the maximum relative error appears to be between 
0.20 and 0.25. For p = 0.99 (the right hand side), one of the samples Is in much 
closer agreement than all three of the samples when p = 0.95. In the remaining 
two samples, divergence away from y = x begins well before the 95 th percentile. In 
these two cases, the maximum relative error again appears to be between 0.20 and 
0.25, These sample deviations for the null case provide a basis for what deviations 
we might see when approximating response time distributions for our mixed periodic 
and aperiodic scheduling problems. 



Chapter 5 

Mixed Scheduling: Background 
Aperiodics 

In this chapter, we study the behavior of the background aperiodic service dis- 
cipline. For this discipline, priorities are fixed, with priority 1 (highest) belonging to 
periodic tasks and priority 2 (lowest) belonging to aperiodic tasks. We will attempt 
to characterize both the steady state response time and system size distributions. 
Most of our investigations will focus on heavily loaded systems, since it is peak traffic 
periods (which might also correspond to long periods of transient overload) that are 
of concern in real-time systems. We are also primarily interested in (conservative) 
estimation of the right tails of the distributions. 

5.1 System Specification 

Given the scheduling discipline, there are four parameters that define the system 
specification: 

1. the periodic task stream's interarrival rate or equivalently, the time between 
periodic arrrivals (H = A]" 1 ), 

2. the periodic compute time (C = Pi 1 ), 1 

X E stands for Hyperperiod. When only a single periodic task is present, H is its period. When 
there are multiple periodic streams present (discussed in Chapter 7), H is defined to be the least 
common multiple of the periods of all periodic tasks. Similarly, C stands for task Compute time. 
This notation has become standard in real-time scheduling theory for periodic tasks and we will 
often make use of these mnemonics. 
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3. the aperiodic task stream's interarrival rate A = A 2 and 

4. the aperiodic service rate // = p>i. Unless otherwise stated, p, 2 = 1 in the 
reported simulation results. Problems with different values of ^2 can be scaled. 

In turn, these parameters define: 

1. the periodic utilization (pi = = C/H), 

2. the aperiodic utilization (pz = A2/M2 = A//z) and 

3. the system utilization (p = pi + P2). Under conditions of heavy traffic, the 
system is near saturation (i.e. p « 1). We mostly consider values of p = 0.99 
and p = 0.95 since heavy traffic conditions provide bounding curves for lighter 
traffic conditions. 

5.2 Aperiodic Response Time Analysis 

We will find that different models provide better estimators for different parameter 
specifications. In particular, for fixed M2>P2, and p, the models change as H (and 
hence C) increase. When comparing the foreground and background aperiodic service 
disciplines, it is helpful to introduce the concept of a periodic blocking time. The 
periodic blocking time is defined as the time a periodic task executes while one or 
more aperiodic tasks waits. We let B denote the average blocking time. 

Under conditions of heavy traffic, for the foreground aperiodic discipline, B « 
C. When /zj 1 ^> H, a single aperiodic task will likely experience more than a single 
blocking time during its completion time. In this case, the degraded server model 
is (intuitively) reasonable, and in fact provides fairly good estimates. Under these 
conditions, the parameters define a short hyperperiod model (SHM). 
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When fa 1 « C < ff, then only a small percentage of aperiodic tasks will 
experience a blocking time as a part of their completion time. For C sufficiently 
large, using only first moments, a good fluid flow estimator can be derived. As A 2 
(and p) decrease, this estimator remains valid, and C can also be decreased. Under 
these conditions, the parameters define a long hyperperiod model. 

The remaining case, the intermediate hyperperiod model) poses the greatest 
analytic challenges. In this case, we will see that response times fall in distinct bands 
which can be used to define response time ranges. For the intermediate hyperperiod 
model, we construct a piecewise linear CDF with response time values determined 
by band ranges, and CDF values determined by the observed probability of being in 
each band. 

5,2,1 Short Hyperperiods 

When 1 > H, a typical aperiodic service time will span multiple hyperperiods 
during each of which a blocking time equal to C will occur, except possibly for the 
first hyperperiod execution (when an aperiodic arrives to an empty aperiodic queue 
during a [0,C] time interval). Under these conditions, the degraded server model 
(DSM, see Section 4.2) was observed to work well under a range of traffic loading. 

When fi2 l is not an order of magnitude or more larger than if, but when the 
system is sufficiently saturated {e.g. p > 0.99), aperiodic queue lengths will tend to 
be long, so a response time for a newly arriving task will also tend to span several 
hyperperiods, even though its execution time is blocked by at most one (and probably 
zero) periodic task execution(s). This observation is formalized in Conjecture 5.2.1. 
Since the DSM and HTMs are close to one another, the criterion in Conjecture 5.2.1 
might be viewed as providing a practical rule of thumb for determining conditions of 
heavy traffic. 
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Conjecture 5.2.1 (BGA SHM Conditions) For background aperiodic scheduling , 
when the expected number of aperiodic arrivals during an average (DSM) response 
time is greater than or equal to the expected number of aperiodics discharged in a 
hyperperiod, the DSM provides reasonable estimates. 
More concisely, the DSM is reasonable when 

X ^ sm = Ml - CH-*) - A 2 ) - {H ~ C) ^ 
or equivalently, when 

A2 - H~i + ix 2 (l - Pl ) (Hut)" 1 + (1 - Pi) * 



Given the applicability of the DSM, the approximate theoretical mean and 
variance of the response times are those of the approximating models, given in obser- 
vation 5.2.2. 

Observation 5.2.2 (BGA SHM response time mean and variance) When 
the DSM provides reasonable estimates for the background aperiodic scheduling model, 
the approximate theoretical response time mean is 

m R = R = (/i 2 (l - pi) - A 2 ) _1 , 

and the approximate theoretical response time variance is 

°\ = Ml - Pi) - A 2 )~ 2 . 

This follows from the exponential DSM response time distribution, £(^2(1 — Pi) - A 2 ). 
Observe that for fixed pi, the response time mean and variance do not change with 
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H. Similarly, the DSM response time mean and variance do not change when both p 
and are fixed. 

Table 5.1 shows sets of C) values and minimum A values for which the 
DSM is a decent predictor. For fixed CH~ l , as H increases, so must A to ensure the 
queue lengths remain long much of the time. For an equilibrium distribution to exist, 
p < 1, so A2 = can not be increased without bound, and the DSM is reasonable 
only for H not too large. Table 5.1 contains some observed values for the mean and 
standard deviation of sample response times. 



H 


C 


^2,min 






16 


4 


> 0.693 


m T = 100 


o r = 100 


64 


16 


> 0.735 


theory 


theory 


128 


32 


> 0.742 






16 


12 


> 0.207 


for p = 0.99 


for p = 0.99 


64 


48 


> 0.236 






128 


96 


> 0.242 






8 


2 


0.74 


108.30 


101.49 


32 


8 


0.74 


91.20 


84.09 


128 


32 


0.74 


93.40 


80.70 


8 


6 


0.24 


97.07 


100.77 


32 


24 


0.24 


103.69 


93.93 


128 


96 


0.24 


127.25 


97.51 



Table 5.1: BGA SHM Response Time Sample Moments 



Figure 5.1 shows several empirical response time CDFs with a theoretical 
DSM overlay under a range of conditions which meet and then exceed the criteria in 
Conjecture 5.2.1. Figure 5.2 shows the corresponding Q-Q plots. When H becomes 
long, the DSM becomes much too optimistic. 

In practice, this case is not one of the greatest concern since it occurs infre- 
quently in hard real-time systems and the time an aperiodic is blocked by a periodic 
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can be less variable than the fluctuations in the aperiodic system size process. 
5.2.2 Long Hyperperiods 

When the hyperperiod is much longer than the mean aperiodic service time, we devise 
a long hyperperiod model (LHM) using only first moments to characterize the response 
time distribution of an aperiodic task. This model is applicable under both light and 
heavy traffic conditions, with an increase in H required for applicability as p increases. 
We also provide criteria for when the long hyperperiod model works reasonably well, 
and calculate the approximate mean and variance of aperiodic response times. 

In our long hyperperiod approximation, each hyperperiod is modeled as having 
a queue build-up (B) period, a discharge (D) period and then an M/M/l (M) period. 
In order for this approximation to be good, the hyperperiod must be long enough 
for the discharge to occur so at the end of the hyperperiod, the queue backlog is no 
longer present and the system is essentially in an M/M/l state. The buildup and 
discharge periods will be smaller for lighter aperiodic traffic, and this approximation 
is better at shorter hyperperiods when p 2 is small. This approximation is not good 
when the build-up and discharge periods extend across multiple hyperperiods (such 
as in the previous section). As we will see when p « 1, H needs to be rather long for 
this approximation to be reasonable. 

We now calculate our approximation for suitably long hyperperiods. Let t € 
[OjH] and suppose an arrival occurs at time t* Suppose further that at time i, the 
number of arrivals in that hyperperiod equals the expected number of arrivals in [0, t], 
or equivalently Xt, and that the number of departures in [0, t] is equal to the expected 
number of departures in [0, t] which varies as a function of t due to blocking by the 
periodic task. 

To calculate the expected departure time of an arrival at time t, partition the 
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hyperperiod in three intervals: 

1. At the start of the hyperperiod there are n 0 tasks. 2 In the blocking (B) interval, 
[0, B] all new aperiodic arrivals queue, since they have lower priority than the 
blocking periodic task. In the case of background aperiodic tasks, the left hand 
limit of B is always C. We use the notation B = C to more generally denote a 
(possibly random) blocking time (such as for the foreground aperiodic service 
discipline). 

2. Next is the discharge (D) interval defined on [£, F] where T is the first time, 
since the start of the hyperperiod, that the queue contains n 0 tasks (where n 0 
was the number of tasks at the start of the hyperperiod). Note also that T is 
a stopping time and E(,F) = {fxB)(^ — A)"" 1 . 

3. Last is the M/M/l (M) interval where the process has returned to an M/M/l 
queue beginning in state equal to no- All residual effects of the blocking period 
introduced by the period process are gone. M = [J 7 , H] and can be null when 
the length of B exceeds H-B. When M is routinely null, this approximation is 
poor. For suitably large H, approximating T by its expected value works well. 

Table 5.2 introduces the notation we use to develop the long hyperperiod 
model's response time analysis. When the long hyperperiod model is applicable, the 
stochastic behavior of each hyperperiod is similar, so it suffices to study the response 
time behavior in a single arbitrary hyperperiod. The notation in Table 5.2 is assumed 
to be with respect to some arbitrary hyperperiod, and not the beginning of time. The 
selection of values for n 0 and x Q are described in Section D.l.l of the appendix. 

To approximate the virtual response time of an (hypothesized) arrival at time 
t, the three regions are considered separately. Assume there are no aperiodic tasks 

2 A technique for choosing n 0 is described in Section D.l.l. 
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Notation 


Description 


n 0 

7 

B 
H 

Rb(0 

N B (t) 
Rd(s) 

N D (t) 
Rm{u) 


The number of aperiodic tasks in the system at time 0, the start of the hyperperiod. 
The expected value of an aperiodic response time when there are no aperiodic tasks 
in the system and no (present or new) periodic tasks, xq = no^J *• 
The first passage time to the state, k at time 0 (typically k = no) since departing 
state k after the start of the hyperperiod. T w E[T) = B + (\B)/(n - A) = 
G*B)/(/i- A) -*,)-». 

The compute time of the periodic task, t. B = C, to remind us that C is a blocking 
time and to allow for the possibility that B is a random variable. 
The hyperperiod, or equivalently Af 1 . 

The virtual response time, which is the response time assuming an arrival occurred 

at time t € B = [0, B). For long H, R B (t) « E[R B {t)]. 

The expected number in the system at time t 6 [0, B]. N B (t) = no + Xit- 

The virtual response time for an arrival at time 5 6D = [B,f]. The response time 

formulas differ based on arrival time. Rd(s) « E[R D ($)]. 

The expected number in the system at time t e [#, !F] = D. 

The virtual response time for an arrival at time u € M = [J*, if]. 



Table 5.2: Response Time Notation in Background Mode 

present at the start of the hyperperiod. The exact value of n 0 turns out not to be 
significant under conditions of heavy traffic. 

In the blocking region, the virtual response time is approximated by its mean 
shown in Equation 5.1. At time t, N B (t) = n 0 + A 2 <. No tasks begin discharging until 
time 5, which is B — t units away. Tasks then depart at an average rate of fX2- 

R B (t) « E(R B (t)) = (JB - *) + N B {t)& x = (B-t) + (no + A2t) (5.1) 

For s G [5,5] the process is discharging its queue backlog. At time s, the 
expected number of arrivals is n 0 + A2S- The expected number of departures is 
($ - B)iaz. So the expected number in the system at time s is E[N D (s)) = (no + 
A2S) - (s - B)n = (n 0 + X2B) - (s - 5)(^ 2 - A2). In the region [B> T], by hypothesis 
the queue never empties, so the expected virtual response time is Nd{s)^2 1 ^ shown 
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in Equation 5.2. Note that Rd(s) = Rb(s). 



Rd(s) « E(R D (t)) = B + 



(n 0 - (in - A 2 )s) 



= (B-s) + 



(re 0 + A 2 s) 

/X2 



(5.2) 



Lastly, for u € [T, H] the queue behaves as an M/M/l queue, and the expected 
virtual response time is simply the mean of the theoretical M/M/l response time 
distribution conditional on n 0 tasks in the queue which is G(v>2> n>o)> which has mean 



Since there is always a virtual task, the queue is never empty in the virtual response 
time analysis so the gamma distribution seems an appropriate approximation for 
response times. However, when system traffic is light, an observed queue might often 
empty in the M/M/l interval For an M/M/l queue, the (unconditional) steady 
state departure process has been shown to have distribution £(A 2 ) [35], which may 
be a better model for light system loadings. When T and n 0 are known, the virtual 
expected response time as a function of time can be graphed from Equations 5,1, 5.2 
and 5.3 and is shown in Figure 5.3. 

When the LHM works well, subsequent simulation plots will show that all 
response times will fall in the band defined by lines [(0, C), (C, 0)] and [(0, i/), (i/, 0)]. 

We now approximate J 7 . At time B 1 the expected number of aperiodic tasks 
queued is n 0 + A 2 B. An application of Equation 2.7 leads to Equation 5.4 as our 
approximation; 



R M (u) * E(R M (u)) = — =« 0 . 



(5.3) 



T » E\T\ = B + E[n 0 + X 2 B -> n 0 ] = B + (A 2 B) • E[B] = B + 



X 2 B B 



Va - A 2 (1 - to) ' 
(5.4) 
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arrival time 



Figure 5.3: BGA LHM: E[Response Time | Arrival Time ] vs Arrival Time 

Note that E{F) < H, and thus does not take into account the possibility of statistical 
fluctuations in the queue length process. Also the expected response time E(R) is 
maximum when the arrival is at time zero, and Eb(0) = B + r&o(/42 — A2)"" 1 . Both 
these observations emphasize that this model can only be a reasonable approximation 
when all arriving tasks depart within one hyperperiod and also suggests its use for 
light traffic. 

We formalize conditions when the long hyperperiod model is a reasonable 
approximation in Conjecture 5.2.3. 

Conjecture 5.2,3 (BGA LHM Conditions) For background aperiodic scheduling, 
when the M/M/l subinterval of the hyperperiod exceeds some multiple of the response 
time standard deviation in the degraded server model, the long hyperperiod (LHM) 
model provides reasonable estimates. 

More concisely, the LHM is reasonable when 



Xq + kp l (T rj fi gm < H — J- 



(5.5) 
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for some constant k Pl =k/pi > 0, or equivalents, when 

Xo + (MiX)-x i ) <H ~{£^MY (5 - 6) 

// we view x$ as a constant (when in fact it depends on Xi)> aT ^d let 
A= 1.0-x 0 H-\ 

B — 2n 2 (p l -l) + k Pi H- 1 -t-iteXoH-^-p!), and (5.7) 
C = (Ml - Pl )) 2 - (k^H- 1 ) - (x Ql 4H- 1 )(l - Pi), 

then the criterion reduces to 

(5.8) 



Equation 5.8 defines a criterion for when the LHM can be expected to give reasonable 
estimates for applications. To understand the intuition behind Equation 5.5, refer to 
Figure 5.3. Roughly speaking, the condition of Equation 5.5 can be viewed as an 
approximate confidence interval. Let I m be a random variable defining the length of 
the M/M/l interval (technically, I m = H - F). When in the discharge interval, we 
postulate that the queue length (and response time) behavior is either approximated 
by or no worse than what we would observe under the DSM conditions. At the start 
of the M/M/l interval, we pessimistically assume the response time variance for the 
first task to begin/resume service is cr£ dsm . This criterion is simply that the left hand 
response time interval limit does not exceed the M/M/l interval duration, H - T. 

When the condition in Equation 5.8 is not met, there are likely to be intervals 
(e.g. transient periods) where many arrivals will not depart within H time units of 
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their arrival time in which case the long hyperperiod model cannot be expected to 
hold. Observations suggest a value of k = 6 works well, with k Pl = /e(pi) -1 . 

Table 5.3 lists some sample ranges of A 2 for which the LHM is a reasonable 
estimate given the other system parameters, k = 6 has been chosen large enough that 
the A ranges appear slightly conservative thoughout all values of if. For A2 < A 2)max 
the fit is minimally good to the 99 th quantile or better. When « 1, the fit is good 
or better even when A 2 > A 2 , max . For example, consider H = 16384, and A 2 = 0.24, 
where the fit is exact. The quantile column lists the quantile at which the predicted 
and observed diverge noticably. The fit column list a qualitative assessment of the fit 
upto the quantile at which divergence occurs. 



H 


C 


Pmax 


A2 max 


approx. fit 


quantile 


A 2 


128 


32 


0.67 


< 0.42 


poor 




0.60 


256 


64 


0.79 


< 0.54 


moderate 


x(.95) 


0.60 


512 


128 


0.87 


< 0.62 


good 


x(.99) 


0.60 


512 


128 


0.87 


< 0.62 


moderate 


x(.95) 


0.70 


1024 


256 


0.91 


< 0.66 


good 


x(.97) 


0.70 


2048 


512 


0.94 


< 0.69 


very good 


x(.99) 


0.70 


4096 


1024 


0.96 


< 0.71 


moderate 


x(.95) 


0.74 


8192 


2048 


0.97 


< 0.72 


good 


x(.97) 


0.74 


16384 


4096 


0.98 


< 0.73 


very good 


x(.997) 


0.74 


128 


96 


0.75 


< 0.00 


good (in tails) 


x(.99) 


0.10 


256 


192 


0.83 


< 0.08 


very good 


x(.99) 


0.10 


512 


384 


0.88 


< 0.13 


excellent 


x(1.0) 


0.10 


512 


384 


0.88 


< 0.13 


very good 


x(.99) 


0.20 


1024 


786 


0.92 


< 0.17 


excellent 


x(1.0) 


0.20 


2048 


1536 


0.94 


< 0.19 


excellent 


x(1.0) 


0.20 


4096 


3072 


0.96 


< 0.21 


good 


x(.99) 


0.24 


8192 


6144 


0.97 


< 0.22 


excellent 


x(1.0) 


0.24 


16384 


12288 


0.98 


< 0.23 


exact 


x(1.0) 


0.24 



Table 5.3: BGA LHM Response Time Criterion Evaluation 
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A CDF is easily constructed from Figure 5.3 and is shown in Figure 5.4. 
In Figure 5.3, the slope between [0,B + x 0 ] and [B,x 0 + foB] is (p2 — 1)- This 
turns out to be the same slope as between points [B,zo + foB] and [T.x^Y * n 
the simplest construction of the long hyperperiod response time CDF, there is a 
point mass at x 0 with measure 1 - FH~ l . The jump height can be viewed as the 
probability an aperiodic arrival occurs in the M/M/l interval. It can also be thought 
of as the probability that a randomly chosen aperiodic task experiences no blocking 
delays, either directly or from residual backlog. Then there is a (straight) line from 
[J°, 1 — TH~ 1 } to [B + x 0 , 1.0]. The slope between x-coordinates x 0 and B + x 0 is 
[(1 - p2)H}~ 1 , which rises slowly when H is large. An expression for the CDF shown 




X 0 Xq+ B response time 

Figure 5.4: BGA LHM Predicted Response Time Distribution 

in Figure 5.4 is given in Equation 5.9. Note that the quantity 1 — (x — Xq)B~ 1 can 
be interpreted as Pr(blocking time a newly arriving task experiences > x — xq) for 
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x e [xo, B + Xq]. 



Ri(x) 



0 for x < xo 

1 - (i^j for x = xo 

l-Cl^a-^) for*o<z<-B + s 0 

1 for x > B + xq 



(5.9) 



Using the CDF shown in Figure 5.4, the distributional moments are easily 
computed. Conjecture 5.2.4 lists the response time mean and variance. 

Conjecture 5.2.4 (BGA LHM response time mean and variance) When the 
LHM is reasonable, the mean response time is given by 



E[R] = xq + 



and the variance is given by 



2(1-^) 2(1 -p 2 ) 



for B » xo, 



(5.10) 



Var[jR] = Pf [l _ _ £i ]. 
1 J ^ 2 (l-p2) l 3 4(1- P2) J 



(5.11) 



Both these calculations assume a point mass of I- TH 1 = 1 - pi(l - prj 1 at point 



Let G(t) = P(an arrival occurs in [0,t] \ an arrival occurs in [0, f/]). Then, by the 
assumption of Poisson arrivals, G(t) — tH~ x and g(t) — dG(t)/dt = if" 1 . The mean 
can also be computed as follows 



E[R] = /* [arrival at t]g{t)dt 

== H-'[tf((B -s) + [n 0 + X 2 sW)ds + // x 0 ds] 
= x 0 + |(p 1 B)(l^p 2 )- 1 . 



(5.12) 
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Table 5.4 shows serveral examples of theoretical and observed means and standard 
deviations. The observed values are based on a single simulation run at the respective 
parameter setting. 



H 
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r 
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a T 


d T 


f"max 


2048 


512 
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153 
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4096 
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1375 
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2048 




1024 


984 


625 
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2400 


2048 


1536 


0.24 


842 


758 


466 


449 


1600 


4096 


3072 




1540 


1516 


898 


898 


3100 


8192 


6144 




3137 


3032 


1824 


1797 


6200 



Table 5.4: BGA LHM Sample Moments 



The LHM is applicable when system traffic is not "heavy" . Rather than assign 
a point mass at Xq = rc 0 /i^\ it makes sense to define x 0 to be such that response times 
less than xq are defined by the M/M/l response time curve, and for those greater than 
no^2 1 the response time curve is defined by the long hyperperiod model curve. This 
not only eliminates any point masses in the response time curve, but it also increases 
xo with decreasing p2 (and fixed pi) corresponding to an increase in the M/M/l 
interval (in contrast to decreasing N 2 with decreasing z^), as one would expect. This 
adaptation is described in Section D.l.L 

The predictions given by the CDF in Figure 5.4 are optimistic near the bound- 
aries of LHM conditions in Equation 5.8 in which case a model for intermediate hy- 
perperiods might provide better response time estimates. The corresponsding Q-Q 
plots are shown in Figure 5.6. 
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Figure 5.5: BGA LHM Response Time EDFs 
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Figure 5.6: BGA LHM Response Time Q-Q Plots 
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5.2,3 Intermediate Hyper periods 

When neither the SHM nor the LHM are applicable or their application is borderline, 
we make use of an intermediate hyperperiod model (IHM) (also called a piecewise 
linear model, PWLM) which collects statistics from various regions of the response 
time bands. First we give the conditions for which the IHM is reasonable. We then 
formalize the notion of a response time band structure and illustrate it with several 
simulation results. 

We then identify response time values which tend to both coincide with no- 
ticeable slope changes in the response time curves. We call these points slope change 
(response time) values and denote them by V. These values can be related to the 
response time band structure, and consequently are a function only of the system 
parameters (i.e. of A 2 ,M2 = 1> H and C). 

The collection of slope change values along with the probability that a response 
time is less than these values can be used to construct a piecewise linear response time 
curve. Algorithms to efficiently collect simulation data from process state variables 
for the probabilities of being less than various critical values are constructed. Our 
proposed intermediate hyperperiod model lacks in mathematical eloquence, but is 
efficiently produced and can be rapidly evaluated against observed data. 

For the intermediate hyperperiod model, queue lengths grow sufficiently large 
that the queue discharge will often not occur in one hyperperiod, rendering the LHM 
inappropriate. Analogously, when the build-up and discharge queue lengths are suf- 
ficiently variable among different hyperperiods, the DSM is also inappropriate. This 
leads us to Conjecture 5.2.5 for the criteria of when the IHM applies. 

Conjecture 5.2.5 (BGA IHM Conditions) For the background aperiodic schedul- 
ing model, the IHM is a reasonable estimator of response times when X? satisfies 
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Equation 5.13. 

- pl) ~ th ~ 2 V W + ~~h— - Aa - (ff^-i + (i - Pl y 

(5.13) 

Conjecture 5 A3 is an immediate consequence of Conjecture 5.2 A and Equation 5.8 
of Conjecture 5.2.8 (assuming these conjectures are both valid). 

Given an arrival time of t mod H £ [0,if], the set of possible response times 
is defined by a set of bands as shown in Figure 5.7. The empty bands are the result 
of periodic blocking, where no aperiodic tasks can depart. The non-empty bands are 
called the response time bands. The width of each response time band is H - C, 
consequently the bands on the right hand side of Figure 5.7 (where C = are 
much narrower than the bands on the left hand side of Figure 5.7 (where C = \H). 
The k** 1 band is labeled B^- The intermediate hyperperiod model is applicable when 
response times fall in at least two bands and both bands contain at least a few percent 
(e.g. > 3%) and the DSM model does not apply. 3 Several observations can be made 
from Figure 5.7. First, the response time ranges where bands overlap are defined by 
response time values in ranges [kH + C>(k + 1)H] for k € {0, 1, 2, ...}. There is no 
overlap in the ranges [kH, kH + C] for k € {0, 1, 2, ...}. 

Intuitively, when H — C is small (relative to if), the regions of overlap among 
any two bands is also small (relative to the area of each band). Let Ak be the area 
of B k . Then A x = \{H + C)(H - C) and A k = H(H - C) for k €, {2, 3, 4, ...}. The 
area of the overlap region between adjacent bands is A Q = \{H — C) 2 . Let R be a 
randomly chosen response time. If the p k = P[R 6 B k ] were known, a conservative 
response time CDF can be constructed using points (0,0) and {(kH + C, /?£)}, where 



3 In fact, the PWLM works well when estimating response times that are well approximated by 
the DSM model. 
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Figure 5 J: BGA IHM Response Time Bands 

P*k — X^LiPj- When p 1} the periodic utilization, is fairly large, this estimator can be 
reasonable. Other times, a band's overlap regions cannot be attributed solely to the 
lower portion of the band, and the overlap regions must be accounted individually. 

Figure 5.8 shows response time plots as a function of arrival time (within a 
hyperperiod). It is easy to see the distribution of the data within a hyperperiod is not 
uniform along the vertical (response time) axis. With the exception of B u the density 
of the samples decreases with an increase in response time. And this density appears 
approximately uniform across the hyperperiod. In B\ there is additional structure 
which we defined in the LHM. 

We summarize how the PWLM is constructed: 



1. From the response time bands, define the set V of slope change values for 
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Figure 5.8: BGA IHM Response Time Data Bands 
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response times. Within Bu changes occur at 0,:ro,C, and H. 

C is the smallest response time that is resident in both B\ and B 2 and H is the 
largest response time that can be resident in both B\ and B 2 . 

For each B k , include points (k — 1)H + C and kH. Also include an observed 
point v max , which is found when we collect values for p* k = P[R < Vk]. 

Finally, V = (0, Vq,Vx, ...,V2k-uV2k> —j^max), where v 0 = x 0 , and for k > 1, 
t? 2 fc-i = min((& — 1)// + C,^ ma x) and V2* = min(fci/,u max ). For example, V 
might be (0, z 0 , C, H,H+C, 2H, 2H+C, 3#, tw) where 3i/ < v mdx <3H+C. 

Let / be the index of v max . Vector V now contains 1 + 2 points. 

2. Next define a set of probability values P = (0,po,pi, ... 5 Pj) where p^ = -Pfaj-i < 
J2 < v,-]. Generally the Pj are observed, po and pi are special cases, where 
Pi = P[R < C] and p 0 = (1 - FH~ l )p v P* = (0,pS,p|, ...,p? = 1.0), where 
p$ = po = (1 - ?H- l )px and pj = p x - p 0 . Then p£ = J** Pi = W < »*]■ 

3. Apply populated band clustering to the bands with the longest response 
times. If two or fewer percent of the response times lie in a single band, group 
them with the next lower band. Repeat this process until the band with the 
largest response times contains at least 3% of the data. 

4. If fewer than 98% of the response times are less than C + xo (which is the 
LHM), apply exponential smoothing to the band containing the largest response 
times. Exponential smoothing applies except when there is a match with LHM 
to quantile x(.98). 

In the band for which exponential smoothing applies, the response time distri- 
bution characteristics are defined in terms of partial (v$ full) blocking times. 
The distribution of blocking times will be covered in Chapter 6. 
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An algorithm for exponential smoothing follows. Let k satisfy fc€{l,...,/ + l} 
and pj < 0.02 for each j > k. Let p* = p* k and let x* be the DSM inverse of p*, 
so x* = -log(l - p*)(ji - A 2 )~ x . Finally, let Xq = v^-i and = xl ~ x * ^ 0* 
For x € [vjb-i, Vmax], define Ri{x) = 1 - i n practice, pick some 

number of evenly spaced v- values for which to compute probabilities. 

5. Plot the points defining the PWLM using V as the x-coordinates, and P* as the 
y-coordinates. Construct line segments between adjacent points so the PWLM 
response time EDF is (right) continuous. 

The data used by the LPWM can be obtained from system simulation, when 
one exists. It is often easier and faster to construct a state variable process simu- 
lation to obtain data points. A state variable simulation algorithm is developed in 
Section 5.2.4. 

Figure 5.9 shows examples of several PWLM response time CDFs overlayed 
on system simulation EDFs. Figure 5.10 shows corresponding the corresponding 
response time Q-Q plots. 

When system simulation data is used, all of the slope change points will fall on 
the observed empirical distribution function prior to jcJ, where exponential smoothing 
begins. Generally the IHM fits reasonably well. It reduces to the LHM when exponen- 
tial smoothing is not applied, and it is not difficult to see that when there are many 
bands, it approximates the DSM fairly well. When there are three response time 
bands, the estimate in the middle band is often conservative, since partial blocking 
is common within both the second and third bands. 

5.2,4 Response Time Variable Process Model 

In this section we define the response time behavior in terms of the state variables 
used to characterize the response time process. The state variable process model is 
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Figure 5.9: BGA IHM Response Time EDFs 
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Figure 5.10: BGA IHM Response Time Q-Q Plots 
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an exact specification of the response time process, although we know of no solutions 
or non-numeric approximations for its behavior beyond the special cases we have 
presented. 

In this section define W a (t) be the amount of aperiodic work in the system 
for an arrival at t € [0, H] where W a (t) includes the workload contribution for the 
newly arriving task. In other words, let = = the arrival time of the k ih 

aperiodic to arrive to the system (since time 0). Assume t 0 = 0 = W o (0), then 

W a (t k ) = max(0, W a (tk-i) ~ (<* - fc-i)) + X**. 

Define R a {t> W a (t)) = Ra(t) to be the response time of an arrival given the 
arrival time is at time t = t mod H and the aperiodic work in the system just after 
the instant of arrival is W a (i). Refering to Figure 5.7, it is not difficult to see that 
for k € {1,2,3,...}, 

P[R a (t) < kH] = P[W a (t) < k(H - C)} for t e [0, HI (5.14) 

Boundaries at multiples of H are nice in that they do not depend on the time of an 
arrival (modulus H). This is not true for any other set of response time values but 
some simplifications result for other special cases. In particular, for k € {0, 1, 2, ...}, 
we wish to compute P[R a < kH + C). The set of response time values defined by 
{kH, kH + C for k = 0, 1, 2, ...} defines the upper and lower bounds on the points 
of intersection within the various response time bands. In other words, the range of 
Bk = [max(0, (k - 2) if + C), kH] and the range of the intersection of Bk n J9*+i = 
\(k - l)H + C y kH] and for k > 1, the region of Bk that does not overlap with 
either B^-i or Bk+x is [(k — 1)H, (k — 1)H + C]. This partioning captures the band 
interactions in which response times distributions as a function of time appear similar. 
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Let m = min{C, H -C) and let M = max{C, H - C}. Then Equation 5.15 
defines the probability that a newly arriving task at time t mod H falls below kH+C 
for k e {0,1,2,...}. 



P[R Q (t)<kH + C} = < 



P[W a (t}<k(H -C) + t] forr€[0,ro] 
P[W a (t) < k(H -C) + m] for t € [m, M] 

P[W a {t)<k{H-C) + {t-m)\, te[M,H] (5.15) 



Equation 5.14 gives us the relationship 



so for 



W (t) 

k = [777^7^1 we have (/c - 1)H < « fl < fcif. 



Equation 5.15 is then used to decide if (k - 1)H < R a (t) < (k - \)H + C or {k- 
l)H C < R a (t) < KH. The index for Vj is j = 2k — 1 in the former case and 
j = 2k in the latter case. Figure D.l (Appendix D) summarizes in pseudo code the 
algorithm just described to determine the vectors V and P* as defined in Step 2 of the 
Algorithm in Section 5.2.3. The algorithm in Figure D.l is called the state variable 
process simulation algorithm for BGA IHM estimation. 



5.3 Aperiodic System Size Analysis 

In this section we consider models for system size, since system size is invariant with 
respect to the service discipline within a class. The system size analysis proceeds 
essentially analogously to the response time analysis for short and long hyperperiods, 
so the presentation is brief. Theoretical means and variances are computed for the 
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long and short hyperperiod models, and then compared to the data. We leave the 
exploration of a BGA IHM of system size for future work. Last, we look at the 
aperiodic queue length at periodic process departure points and compare them to 
crude theoretical estimations in an attempt to find regions for which asymptotic 
analysis might work well. 

5.3.1 Short Hyperperiods 

Using arguments similar to those given in Section 5.2.1 a DSM is used to model 
aperiodic system size under conditions set forth in Conjecture 5.2.1. The distribution 
for the limiting system size discrete random variable is given in Equation 5.16. 

N(n) = P[N 2 < n] = 1 - p< n+1 > = 1 - (^—^l (5.16) 

1 - pi 

By visual inspection, Equation 5.16 tends to be more optimistic and does not fit 
the state data quite as well as the response time DSM model, but it is certainly 
a reasonable engineering model. We make no attempt to sharpen the boundary 
conditions to ensure the applicability of the DSM for system size. Unlike the response 
time analysis, there is a noticable difference between the DSM and the HTM for 
system size. In all observed cases, the HTM is more optimistic than the DSM. 4 

Given the applicability of the DSM, the approximate theoretical mean and 
variance of the response times are those of the approximating models, given in obser- 
vation 5.3.1. 

Observation 5.3.1 (BGA SHM system size mean and variance) In the back- 
ground aperiodic scheduling model, when the DSM provides reasonable estimates, the 

4 In the HTM, it turns out that N\ 0 so N and N 2 Equation 5.16. Since periodics do not 
queue, \Ni(t) - N(t)\ < 1 for all t. No adjustments have been made to the HTM approximations. 
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approximate theoretical system size mean is 

m N = N = p(l- p)-\ 
and the approximate system size variance is 

4 = p(i - pY\ 

where p = A2[a*2(1 - Pi)]"" 1 - The DSM model is also an M/M/l queue, with arrival 
rate A 2 and service rate fi ^ ^{l - p x ). The system size mean queue length and 
variance of an M/M/l queue are given in Table BA. 

Observe that for fixed pi, the state data mean and variance do not change with 
H. Unlike the response time DSM, the state mean and variance do change for fixed 
H and varying p x . 

Table 5.5 shows sets of (H, C) values and minimum A 2 values for which the 
DSM works. For fixed C7/~ l , as H increases, so must A 2 to ensure the queue lengths 
remain long much of the time. For an equilibrium distribution to exist, p < 1, so 
A2 = M2P2 can not be increased without bound, and the DSM is reasonable only for H 
not too large. In Table 5.5 are observed values for the mean and standard deviation 
of sample aperiodic system size. There is better agreement for p\ = 0.75 and also less 
variability among observations. H = 128 is outside the DSM range for both values 
of pi. 

Figure 5.11 shows several system size EDFs with a theoretical DSM overlay 
under a range of conditions which meet the criterion in Conjecture 5.2.1. For p = 0.85 
and A2 = 0.1 (not shown) the predictions are fairly optimistic. At the boundaries 
with equal p, the fit is better for p\ = .25 than for pi = 0.75 which is not surprising 
since the aperiodic queue length size will be larger in the former case. 
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0.24 


23.49 
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Table 5.5: BGA SHM System Size Criterion Evaluation 
When H becomes long, the DSM becomes much too optimistic. 

5.3.2 Long Hyperperiods 

For long hyperperiods, the queue length buildup can be analyzed in much the same 
way as response time delays. First the hyperperiod is decomposed into three regions; 
a blocking interval B = [0, C = B], a discharge interval D = [C, where T is the 
first time the queue length returns to n 0 , its value at the beginning of the hyperperiod, 
and last an M/M/l interval M = [J 7 , H). Queue length as a function of hyperperiod 
is shown in Figure 5.12. For consistency with our value of n 0 in the response time 
distribution estimators, we take no = (1 - p2)~ 1 ln((l - /9 2 )" 1 /9i)- 

From Figure 5.12 it is easy to see that for n 6 (no, no + ^B], 

PrM < n) = 1 - = 1 - -r^ r[n - (n 0 + X 2 B)]H^ 

where t\ = (n — n 0 )A 2 1 and £ 2 = (w — (jU 2 J9 + n 0 ))(A 2 - which leads to an 
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Figure 5.11: BGA SHM System Size EDFs 
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Figure 5.12: BGA LHM: E[System Size | Arrival Time] vs Arrival Time 



approximating aperiodic system size CDF given in Equation 5.17. 



?t{Ni < n) = i 



1-f 



1- 
1 



for n < 0 
for n = n 0 

Hnin-xi) • («o + A 2# - n) for n € K no + A 2 £] 

for n > n 0 + A 2 5. 



(5.17) 



Equation 5,17 can be adapted in the obvious ways for values of n E [0, no], but under 
conditions of heavy traffic, these terms do not significantly change the shape of the 
distribution. The long hyperperiod model is also appropriate for moderately loaded 
systems. The CDF for aperiodic system size described by Equation 5.17 is graphed 
in Figure 5.13. 

The conditions for which the LHM appears a reasonable model for system size 
are characterized in Conjecture 5.2.3. Given applicability of the LHM, the approx- 
imate theoretical mean and variance of system size are those of the approximating 
model, given in observation 5.3.1. 

Observation 5.3,2 (BGA LHM system size mean and variance) In the back- 
ground aperiodic scheduling model, when the LHM provides reasonable estimates, the 
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Figure 5.13: BGA LHM Predicted System Size CDF 



approximate theoretical system size mean is 



m N = N = n Q + 



2(1-/02) 3 



and the approximate theoretical system size variance is 

Pi 



2 (W^i 



l-to l 3 4(1 -ft) 11 

6oi/i 0/ wAsc/a follow from standard calculations using Equation 5.17. These calcu- 
lations have assumed a point mass at no, and continuous (not discrete values) for 
n 6 (n 0) n 0 + \ 2 B]. Note that E[N] w A 2 £[#] and Var[iV] = A^Var[i?]. 

Table 5.6 shows several examples of theoretical and observed means and stan- 
dard deviations. The observed values are based on a single simulation run at the 
respective parameter setting. Even though the means and standard deviations are 
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Table 5.6; BGA LHM System Size Sample Moments 

fairly close, the LHM for system size is optimistic in the tails at the boundary con- 
ditions. As H increases, the optimism becomes reduced. Figure 5.14 overlays several 
empirical distribution functions with the system size estimators for long hyperperiod 
given in Equation 5.17. 

5*3.3 Aperiodic Queue Lengths at Periodic Departures 

The last topic we look at in this chapter is the distribution of the aperiodic system 
size at periodic task departure times. Since we don't make use of these results for 
our background analysis, we make some simple unrefined observations which are 
summarized in conjecture 5.3.3. However, the distribution of blocking times and the 
related distribution of aperiodic queue lengths at periodic departure times turns out 
to be helpful when studying the foreground aperiodic discipline. 

Table 5.7 lists the means and standard deviations, when p = 0.99 of aperiodic 
queue lengths at both periodic departure times (i.e. when t mod H = C) and aver- 
aged over all time. Let be the number of aperiodics in the system at the time of a 
periodic departure (just after a blocking period), and let and at denote the mean 
and standard deviation of N b , respectively. Several observations can be made. Not 
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Figure 5.14: BGA LHM System Size EDFs 
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surprisingly, <7& tends to be less than a n (the time averaged sample standard devia- 
tion). For X 2 = 0.74, a b is essentially constant and does not depend on hyperperiod 
length in the range considered. Also, when the DSM conditions hold, <3& « a n and 
rht « m n , suggesting that the aperiodic system size behavior at a particular point is 
representative of all points within a hyperperiod. When either the IHM or LHM hold, 
™>b > with the difference increasing with increasing if, again not surprisingly. 
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Table 5.7: BGA LHM System Size Moments at Departure Times 

Let N d be the average number of aperiodics in the DSM, so N d = A 2 [m2(1 - 
Pi) - Aa]" 1 - Define the heavy traffic estimate for the mean number in the system as 

N °l M2(l-Pl) 2 + A 2 

h 2m n 2(^(1 -,n)-A 2 )' 

where a\ and m n are the diffusion coefficients for the system size sketched in Sec- 
tion 3.4.1. 
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Conjecture 5,3.3 (Aperiodic Queue Length at Periodic Departure Times) 

In the background aperiodic scheduling model, under conditions of heavy traffic, the 
following models approximate the equilibrium distribution of the aperiodic queue length 
at the time of periodic departures. 

When the DSM is applicable, then 

Pr[N 2b < nj « 1 - /5 (n+1) , for n € {0, 1,2, ...} and p = . 

1 - Pi 

(5.18) 

One way to think about Equation 5.18 is that when the blocking time is sufficiently 
short, fluctuations in queue length are (statistically) indistinguishable at these points. 
The mean is given by 

P A 2 
l-p ti 2 (l -pi) - A 2 

and the variance is given by 

Note that the same consistent bias appears in a b that appeared in a n for the DSM (See 
Table 5.7). 

Under conditions of the LHM 

As H t oo, a b -> max(a d , V%C) and m b -> A 2 C. (5.19) 

When the quantity A 2 C is large, the accumulation over the blocking interval will pro- 
vide the dominant terms in the mean and variance. Note, these limiting values are 
only approximate for H = 8192 and A 2 = 0.24. Even for H = 8192, when C = \H the 
blocking times are not long enough to experience most of the queue length variability. 
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Based on visual inspection of the data, when in the IHM and LHM regions, 
if &l ^ ^2<? then it is plausible that N b ~ JV^C, A2C). When of « cr^, i/ien 
AT& ~ £(r, 5) w/iere r = N^ 1 and 8 = max(m — 0) w/iere m ^ A 2 C ususally 
larger). In the latter case, 

P(N b <x) = {l-e~ r ( x -V)[x>5}. 
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Figure 5.15: BGA: Aperiodic Queue Length EDFs at Periodic Departures 
A few EDF plots for aperiodic queue length at the time of periodic departures 
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under conditions of the LHM are shown in Figure 5.15 for p = 0.99 and p = 0.95. For 
fixed p } pi must be large enough for the asymptotics to apply. When p = 0.95, the fit 
is reasonable for both values of p\. When p = 0.99, the fit in the tails is not good for 
pi = 0.25. Our conjecture is obviously lacking since even first and second moments are 
not close for relatively long hyperperiods. It seems possible to more carefully derive 
estimates of aperiodic queue length distributions at periodic departure instants using 
heavy traffic theory techniques. However, we do not make use of these results, and we 
have seen cases where the theoretical estimates axe not particularly good in practice. 



Chapter 6 

Mixed Scheduling: Foreground 
Aperiodics 

Strictly speaking, the title of this chapter is a misnomer since the periodic task 
deadline requirement guarantees every periodic task C units of processor time 
within H units of its arrival. Periodic tasks will have foreground priority when nec- 
essary to meet its deadline requirement. If we define the slack to be the time to a 
periodic task's deadline minus the remainder of its computation time, then aperiodic 
tasks are only (high priority) foreground tasks when slack is available. When slack be- 
comes unavailable, their priorities drop to the background priority. So "foreground" 
actually refers to the possibility of an aperiodic task having foreground priority for 
some portion (possibly ail) of its system time. 

The remainder of this chapter is organized as follows. First, we provide an 
overview of the system specification while introducing a definition for the periodic 
blocking time, now a random variable. The periodic blocking time is then conceptu- 
ally equated to the execution time of a (quasi-)periodic, for which the system anaysis 
models are developed. 

The types of analyses are again subcategorized according to hyperperiod length. 
This is a simplified categorization, since factors other than hyperperiod length de- 
termine the applicability of the various modeling approaches. We again focus on 
estimation of the right tails of distributions. 
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6-1 System Specification 

The now familiar parameters H } C, A2 and ^2 are sufficient to fully define (but not 
necessarily analyze) the system behavior. 1 For the foreground aperiodic model, we 
find it useful to characterize new parameters to help analyze system behavior. In 
particular, we are interested in estimating the following quantities: 

1. the periodic blocking time distribution B (when attainable), 

2. the average blocking time J3, when an estimate of the distribution of B is not 
attainable, 

3. u>o, the probability that blocking does not occur in a hyperperiod, a> m , the 
probability that maximum blocking occurs within a hyperperiod (both when an 
estimate of the distribution of B is not attainable), and 

4. B b} the average blocking time conditional on some blocking occuring in a hy- 
perperiod, where B = uq • 0 + (1 - lvo) • B&. 

Given a new definition for periodic blocking utilization is pi = BH* 1 . A new 
definition for system utilization becomes p = pi + p2- Note that unless B & C 1 
the aperiodic traffic is often not in a state of heavy traffic. The periodic traffic is 
sometimes in a state of heavy traffic, but its deadlines are met by design. 

6,2 Blocking Time Analysis 

A primary difference between the analysis of the foreground and background aperiodic 
service disciplines is the blocking times in the former are described by a non-constant 
random variable. At the top of Figure 6.1 is a sequence of hyperperiods illustrating a 

1 These system parameters are defined in Section 5.1. 
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Figure 6.1: FGA Blocking Transformation Example 

possible blocking time sequence. Note that the blocking times (when non-zero) occur 
at the end of the hyperperiod, where aperiodic tasks are suspended to allow the 
periodic task to complete by its deadline. If we define those hyperperiods in which 
no blocking occurs to have a blocking time of zero, then the mean time between 
adjacent blocking times is H. The time between the end of adjacent blocking times is 
constant and also equal to H. For ease of analysis, we sometimes find it convenient to 
assume that periodic blocking occurs at the start of the hyperperiod (rather than at 
the end). The bottom of Figure 6.1 illustrates our proposed transformed aperiodic 
foreground model. 

The transformed foreground aperiodic scheduling model can now be studied 
as a background aperiodic scheduling model where the tasks with the periodic ar- 
rivals have random (non-constant) execution times defined only by the amount of 
time for which actual blocking of aperiodic tasks occurs. Assuming the transformed 
model is a reasonable approximation to the actual model, the blocking time distri- 
bution (which conceptually corresponds to the service time distribution for the "high 
priority" periodic tasks) must be specified. 
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In the remainder of this section we characterize some general observations 
about blocking times and propose a candidate model to describe blocking time distri- 
butions. Model refinements for different system configurations are described in their 
respective sections, when determinable. 

Let C(t) denote the amount of periodic execution time consumed in [0, t] for 
t G [0, H] and W^W denote the aperiodic work queued (or in progress) at time i. 
Define an aperiodic idle interval to be an interval during which there are no aperiodic 
tasks in the system. The duration of an aperiodic idle interval is measured from the 
first instant during which there are no aperiodic tasks in the system to the time of 
arrival of the next aperiodic task. Prior to any blocking within a hyperperiod, the 
accumulated execution of the periodic task, C(t) is described by the minimum of sum 
of the aperiodic idle intervals and C. 

Depending on the system configuration, one of three things is likely to happen 
in some percentage of the hyperperiods: 

1. No blocking occurs when for all £ G [0, H] the aperiodic workload does not 
exceed the remaining available time for aperiodics. Strictly speaking, once the 
periodic task completes no amount of aperiodic workload can cause blocking 
within the current hyperperiod (but certainly can in subsequent hyperperiods). 
In symbols, no blocking occurs if 3t e [C, H) such that C(t) = C. 

Let u>o = Pr[no blocking]. For H moderately large, u>o > 0. For H very large 
then a>o « 1, and the aperiodic response time distribution appears (statistically) 
similar to the idealized M/M/l response time distribution, 

2. Maximum blocking occurs when the system is constantly busy with aperiodic 
work in the interval [0,if — C]. In symbols, maximum blocking occurs when 
V s € [0, H — C], W^{s) > 0. In the case of maximum blocking, the periodic 
task executes in its entirety at the end of the hyperperiod. 
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Let Mm = Pr [maximum blocking]. Only for very short H with heavy traffic is 
u m as 1, in which case response times for foreground and background aperiodic 
services visually appear to be largely indistinguishable. For H moderately long, 

3. Partial blocking occurs when some portion of the execution time of the periodic 
tasks occurs in at least one aperiodic idle interval and completes in a blocking 
interval. During the former, no aperiodic tasks are in the system, during the 
latter at least one aperiodic task is queued. In symbols, 3 si,s 2 € [0,// - 
C) such that 0 < s x < s 2 < H - C, and V* e [$i,s 2 ], W 2 (s) = 0. Also, 3 t € 
[H - C, H) such that W 2 (t) > (H - t) - (C - C{t)) is satisfied. The interval 
in the first condition requires that the aperiodic idle interval have non-zero 
measure. 

Let Up = Pr[partial blocking]. 

Let B p (x) = Pr [partial blocking time is < x \ some blocking occurred] and let 
b p be the corresponding pdf for B p . 
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Equation 6.1 summarizes the blocking time notation. 



C{t) the accumulated period compute time at t 6 [0, H] 9 0 < C{t) <C 

W 2 (t) the amount of aperiodic work in the system at t € [0, H) 

n a (t) the number of aperiodic idle intervals in [0, t], t € [0, H] 

uq the probability of no blocking in a hyperperiod 

uj m the probability of maximum blocking in a hyperperiod 

u p the probability of partial blocking 

B p (x) Pr [blocking time < x \ partial blocking] 

b p (x) dB p {x)/dx 

B(x) Pr [blocking time < x] 

b(x) dB(x)/dx 



(6.1) 



Equation 6.2 describes B(x), the probability that the blocking time does not exceed 
x in an arbitrary hyperperiod. 



B(x) = { 



0 for x < 0 
ojq for x = 0 

ujq + u) v Bp{x) for 0 < x < C 

1 for x > C 



(6.2) 



Denote the associated pdf for B(x) by b(x) which is shown in Equation 6.3. 



cjo for x = 0 

u)pb p {x) for 0 < x < C 

uj m for x = C 

0 otherwise 



(6.3) 
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Note that 

B = o; 0 0 + uj p B p + u; m C = o; p JBp + <j m C. 

We currently collect c^o^m and often B p from data generated by the system simula- 
tion. 

6.2.1 Estimating wq = Pr[no blocking] 

In this section we give two mathematical expressions for asymptotic approximations 
of cjo) the probability that no blocking occurs in a hyperperiod. Unfortunately, neither 
approximation turns out to provide good estimates so we ultimately rely on simulation 
data for these parameter values. One shortcoming of our approximations is that they 
consider the behavior of a single hyperperiod and do not take into account the feed 
forward nature of aperiodic backlogs from hyperperiods with blocking. Nonetheless 
it is informative to observe the size and patterns of discrepancies between observed 
and predicted values. 

For the M/M/l (and M/G/l) queue, the distribution of each idle interval is 
£ (A2). Given the number of aperiodic idle intervals in [0,t] is n a (t), then C(t) w 
£h (A2,n 0 (*)), when C(t) < C. 2 Let t h be the first instant at which periodic blocking 
occurs in the hyperperiod. If no blocking occurs, then £ 6 = H. 

The definition of an aperiodic idle interval at the start and completion of hy- 
perperiods adds complications. For simplicity, if at the start (end) of the hyperperiod 
there are no aperiodics, we simply assume this begins (ends) an aperiodic idle inter- 
val, even though the actual beginning (ending) almost certainly started (ended) in 
the previous (next) hyperperiod. For very long hyperperiods, two fractional aperi- 
odic idle intervals is but a small percentage of the hundreds or thousands that occur. 

2 Q H is a gamma distribution conditioned on the value being less than H. The observed aperiodic 
idle time cannot exceed the observation period. 
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Using the CLT for renewal processes (see the example of Lemma C.l.l), for t < U 
with U large, as 1 1 td, 

n a (t) ~ 7^(*A 2 (1 - f>2) y tX 2 {l - 3p2 + 4p£)). (6.4) 

If the estimates of the number of aperiodic idle intervals is poor, then our Gamma 
approximation for the aperiodic idle time is also likely to be poor. One would not 
expect an asymptotic result to work well for short or even moderate H. 

Our first estimate for o,o is given in Equation 6.5. When we approximate v by 
a normal random variable and we do not integrate over domains where v < 0. 

"<U * £~ o Pr(<?A 2 ,, > C\G^ < H]Pv[n a (H) = u) 

s* J 0 °° Pr[G A2 ,, > C\G M , U < H]dPv[n a (H) < v). 

For long hyperperiods, we consider a second estimate of u, 0 . Let Ij be the length 
of the j th aperiodic idle interval. Then E[Ij] = A2 1 for each j and by the strong law 
of large numbers (SLLN), lim^^n -1 YTj=ih ^ ^2* w.p.l. Let n+ = max(0,n fl ) 
be an estimate of the number of aperiodic idle intervals in the hyperperiod where 
n a = n a (H) ~ Af(HX 2 (l - p 2 ), HX 2 (1 - Zp 2 + 4pl)). For H large, n+ ^ n 0 (with high 
probability). Our second estimate of u>o is given in Equation 6.6. 

u, 0 ,2 = u, 0 « Pr(n a (H) > X 2 C) = Pv(Z > -^Z^LzgM )t (6 . 6 ) 

[H\ 2 (l-Zp 2 + 4f$))2 

where Z ~ JV(0, 1). For short hyperperiods, when not using n+ Equation 6.6 is 
essentially assigning positive measure to cases with an approximated negative number 
of aperiodic idle intervals resulting in an erroneously high probability for no blocking. 
For t > the distribution of C(t) </> {?ff(A 2 , n a (t)). The complete description 
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Table 6.1: Two estimates and experimental data for o; 0 



of the distribution of C(t) is given in Equation 6.7. 



C(t) ~ < 



P(0) n a (t) = 0 and i < # - C 

Gh(^> n«(t)) for 0 < t < t b , n a {t) > 0 and C(t) < C 

Gh(*2, n a (t b )) + (r - t b ) for t b <t<H, n a (t) > 0 and C(t) < C 

(t-{H-C)) for n a (t) = 0 and t > H - C 

V{C) otherwise 
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When U <C H, the idle counts are not approximately normally distributed and the 
accumulated aperiodic idle times are not approximately gamma distributed which is 
likely one reason why our estimates are quite poor for short hyperperiods. 

However, Table 6.1 shows neither estimate of ujq is particularly good for large 
values of H either. Values in the column labelled o> 0 ,i were calculated using Equa- 
tion 6.5 and values in the column labelled u; 0) 2 were calculated using Equation 6.6. 
Only for long hyperperiods when pi = 0.25 (i.e. when most of the traffic is aperiodic), 
are the estimates even approximate to the data and to each other. Our estimates are 
pessimistic when p\ = 0.75 for the longer hyperperiods. 

Let u>o t pi be the value of loo at p\ (for some fixed p and H). In practice we 
consistently observed that for equal utilizations and all if, 0^1=0.75 < ^0^=0.25- 
In contrast, our predicted estimates consistently estimated u;o,pi=:o.25 < ^o,/>i=o.75- 
Despite this troublesome inconsistency, we strongly suspect the observed inequality 
generalizes. 

Informally, fix p and consider only the aperiodic traffic. Prior to periodic 
blocking, in the absense of any initial aperiodic backlog, E[C(t)\ = (1 — p<i)t. For 
s > Pi(l — P2)~ l H y E[s) = C. Now fix P2 and if, then 5 f as p f, suggesting increased 
probability of blocking at the end of the hyper period for heavily utilized servers. For 
fixed p and P2, s f as H f suggesting more aperiodic idle intervals per hyperperiod 
tend to occur, in turn suggesting that fewer aperiodic tasks will experience blocking. 
Lastly, for fixed H and p, as p2 t then s I suggesting that the periodic task tends 
to complete earlier in the hyperperiod. This in turn suggests the probability of 
blocking decreases with decreasing aperiodic traffic, which is again consistent with 
our observations. 

Since we were unsuccessful at finding an explicit expression, we instead rely on 
simulation data for values of ujq. We close this section with Conjecture 6.2.1, which 
we later use. 
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Conjecture 6.2.1 (As hyperperiods increase blocking decreases.) Under the 
foreground aperiodic scheduling model, when p and p\ are fixed, then wq 1 1 as H t oo. 

Using the informal reasoning in the previous paragraphs, in the absence of 
aperiodic backlog and of periodic blocking, the expected time for aperiodic idle time 
accumulation to (first) equal C is 

which clearly increases with H when p and p\ are fixed. Note, when H = oo, the FGA 
discipline reduces to the M/M/l queue and u 0 = 1 for all p < 1 . Also, all data and 
all our various attempts at calculating u>o increase monotonically with H for fixed p 
and p\. 

6-2.2 Estimating u m — Pr[max blocking] 

Maximum blocking occurs when the system never empties of aperiodic tasks in [0, if — 
C\. If at time 0, an aperiodic task arrived to an otherwise empty aperiodic queue, 
then u/ m would be the probability of a busy interval exceeding H—C. For the M/M/l 
queue, the density of the busy interval is given in Equation 6.8 ([35], pg. 33), 

B'{x) = iie^*[I 0 {2 f{\ti)x) - I 2 (2^{\p)x)} [x > 0], (6.8) 

where I r (x) is the modified Bessel function of order r defined by 

00 /„/n\ r _|_2j 



J fx) = V (x/2r 



(6.9) 



The integral, B(x) = f* B f (y) dy can be evaluated with the aid of a computer. How- 
ever u m ^ (1 - B(H - C)), since multiple aperiodics queued at the beginning of the 
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hyperperiod gives rise to multiple busy periods (calculated using convolutions of the 
density given in Equation 6.8) which becomes more complex to evaluate both ana- 
lytically and computationally. The analytic challenge is to describe the distribution 
of queued aperiodics at the end of the hyperperiod, which is related to the blocking 
distributions (since during blocking times, queue lengths only grow). We will later 
look at some sample aperiodic queue length distributions at periodic departure times. 

If we were to consider more general service time distributions for the aperiodic 
tasks, the busy period distribution has also been found for the M/G/l queue ([35], 
pg 58). It involves summing an infinite number of A>fold convolutions, where k is the 
index of summation. This gives rise to a problem of computation and approximation. 
So we rely on simulation data for values for u? m> just as we did for u; 0 . 

When the observed value of u m « 1, the foreground and background aperiodic 
scheduling models appear essentially the same. When the observed value of u} m > 0 
and u/ m 96 1, the blocking time distribution has a point mass of u m at the value C. 
Examples are given in subsequent sections. 

6.2-3 Estimating u/ p = Pr[partial blocking] and B p 

The relationship w 0 + oj p + oj m = 1 clearly holds, so if uj 0 and u m were computable, 
then so would be u) p . Several weighting distributions were observed among the a/ ? s. 

1. u) m « 1: In this case u; 0 « 0 and u p « 1 - u m . For simplicity, the blocking time 
distribution is assumed to be V{B) ^ V{C). 

2. (jj m « 0: When there is no maximum blocking, there may not be, but typically 
is some partial blocking. 

(a) When w p > 0, the blocking time distribution is reasonably approximated 
by an exponential with parameters determined from either heavy traffic 
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theory or the degraded server model. This will be explored in Section 6.6. 

(b) When ujq « 1, the blocking time distribution is relatively unimportant 
since it has so little weight. 

3. u) mi oj p ,ujQ > 0: When u m > 0, there are tasks with completion times that 
experience at least one full block, and possibly more. For those tasks, a degraded 
server model is appropriate. For tasks with completion times experiencing no 
blocking, the M/M/l model is appropriate. For this case we model task response 
times as a mixture of response time models for the M/M/l and DSM queues and 
find cases where the mixture model works well, but also also find cases where 
including the blocking time distribution (perhaps as a part of a completion time 
distribution) might lead to improved estimates which we defer to future work. 

6.2,4 Task vs Hyperperiod Blocking Probabilities 

The previous sections focused on the probability of no blocking in a hyperperiod which 
is different from the probability an individual aperiodic task experiences blocking. Let 
u> 0 be the probability that an aperiodic task experiences no delays due to blocking. 
An aperiodic task, a, is said to experience blocking delays when, while waiting or in 
service, there is some aperiodic task with a completion time that includes some portion 
of a periodic task's execution time. In other words, an aperiodic task's response time 
includes no blocking only if all of its waiting time is due to aperiodic processing and 
its completion time is equal to its execution time. 

When oj m = 0, we use the task blocking approximation given in Equation 6.10. 

Q 0 = u, 0 + (1 - c*)(l ~ F b H' x ) = 1-^ = 1- ^3^y- (6.10) 

Note the similarity to Equation 5.9 when x = x 0 which models the probability an 
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aperiodic task experiences no blocking in the BGA LHM. The assumption that oj m = 
0 says that tasks do not queue for multiple hyperperiods and by implication will 
typically depart within H time units of their arrival. Now, let 7 be the arrival time 
of an aperiodic task and consider the interval [7,7 + 7/]. By the PASTA property, an 
arbitrary aperiodic arrival will see time averages, and hence experience no blocking 
under two conditions. First the task might arrive to a hyperperiod interval [7, 7 + H] 
in which no blocking occurs which happens with probability u>o. Also by the PASTA 
property, the interval [7,7 + H] contains partial blocking with probability 1 - uq. 
In this case, the task must arrive and depart in the M/M/l part of the blocking 
hyperperiod, which happens with approximate probability (1— u;o)(l — F b H~ l ), where 
F b = faBbifa — A2)" 1 = B b (l - P2Y 1 is the expected first return time to the M/M/l 
state when in a hyperperiod with blocking. (See Section 5,2.2 for the derivation of 
F = B(l - p 2 y l and hence for F b .) Note the similarity of form in Equation 6.10 with 
the equation for u 0 given in Section 5.2.2. 

Since we have assumed uj m = 0, B b = B p . However, since our definition of pi 
does not depend on partial blocking (i.e. on u m = 0), we use Equation refeq:task- 
blk to describe task blocking for all of our proposed FGA models. Note that when 
Pi = Pi, our definition of task blocking agrees with the derived definition for the BGA 
LHM case. 

6.3 FGA System Model Parameters 

This section contains a notation summary of variables (parameters and/or statistics) 
used for the analysis of foreground aperiodic (FGA) scheduling assuming the trans- 
formed model illustrated in Figure 6.1 is a reasonable approximation. Some of the 
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notation has been previously defined, and some has yet to be defined. 

B k = blocking time in hyperperiod k y 0 < B k <C 
u> 0 = fraction of hyperperiods for which B% = 0 
oj b = 1 - uj q 

B = average of all 

p x = effective periodic utilization = J3/f ~* 

p = effective utilization, = p x + p% 

Bt> = average blocking time over all Bk > 0, (6.11) 

A = ^B b {ti 2 - A2)- 1 = fi 6 (i - piY x 

0 = average blocking rate over all Bk > 0 

o)o = task blocking probability =1 — pi(l — P2Y 1 

Xq = x 0 = -ln(l - cDo)(M2 - A 2 ) _1 

A =^2(1 - pi) 

For model selection, minimally a value for uo (or equivalent) must first be observed. 
A value for B also is often required. 

6.4 Short Hyperperiods 

As one might expect, when B « C, the foreground aperiodic response times don't 
differ much from the background aperiodic response times. When B w C, then 
/5i = 2?# -1 & CH~~ l = px and very similar criteria used in Chapter 5 can be applied 
here. Conjecture 6.4.1 defines conditions when the DSM can be used to describe the 
response time distribution of the aperiodic tasks. 
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Conjecture 6.4.1 (FGA SHM Conditions) For foreground aperiodic scheduling, 
when the expected number of aperiodic arrivals during an average (DSM) response 
time is greater than or equal to the expected number of aperiodics discharged in a 
hyperperiod, the DSM provides reasonable estimates. 
More concisely, the DSM is reasonable when 



/\<2 

(mi -Pi) - A 2 ) 



or equivalently, when 



\ > M2(l-Pl)(l~^l) M 1 ~ pi) 

2 - (ffMa)-i + (1 - pO 1 + (i«l - pO)- 1 ' 



(6.12) 



4 slightly more conservative but data independent test assumes p\ = 0 and is given 
in Equation 6. 13. 

P*(l- Pi) 



As expected, the DSM applies to the foreground aperiodic model for only a subset of 
the system parameter ranges for which the DSM applies to the background aperiodic 
model. The data sampled suggests the use of XI in Equation 6.13 is reasonable if 
not preferable. Table 6.2 contains sample data with observations for several different 
system configurations. In the quantile column is the approximate quant ile where the 
predicted and observed curves noticably diverge. To that point, the fit varies among 
samples. 
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H 


C 


A 2 


B 


h 


> A m 


>Kn 


DSM fit 


quantile 


4 


1 


0.70 


0.85 


0.212 


.569 


.600 


very good 


x(.994) 


8 


2 




1.59 


0.199 


.649 


.667 


moderate 


x(.95) 


32 


8 




4.70 


0.147 


.723 


.727 


moderate 


x(.95) 


64 


16 




6.66 


0.104 


.737 


.738 


poor 




128 


32 




8.53 


0.067 


.744 


.744 


very poor 




8 


2 


0J4 


1.92 


0.240 


.644 


.667 


very good 


x(.995) 


16 


4 




3.78 


0.236 


.693 


.706 


good 


x(.95) 


32 


8 




7.14 


0.223 


.721 


.727 


moderate 


x(.95) 


64 


16 




13.09 


0.205 


,736 


.738 


poor 




128 


32 




24.39 


0.191 


.743 


.744 


poor 


x(.80) 


4 


3 


0.20 


2.03 


.508 


.166 


.200 


very good 


x(.99) 


8 


6 




3.57 


.446 


.204 


.222 


moderate 


x(1.00) 


16 


12 




5.48 


.342 


.238 


.235 


poor 




32 


24 




10.94 


.276 


.240 


.242 


very poor 




8 


6 


0.24 


5.43 


.679 


.180 


.222 


good 


x(.99) 


16 


12 




10.30 


.644 


.213 


.235 


good 


x(.95) 


32 


24 




18.94 


.592 


.232 


.242 


moderate 


x(.95) 


64 


48 




32.00 


.500 


.242 


.246 


poor 




128 


96 




48.90 


.382 


.247 


.248 


very poor 





Table 6.2: FGA SHM Selection Criterion Evaluation 

6.4.1 Response Time Distribution 

The arguments for Conjecture 6.4.1 are essentially the same as in Section 5.2.1. Re- 
sponse times means and variances can also be calculated using the techniques of 
observation 5.2.2. Figure 6.2 shows foreground aperiodic response time data when 
compared to the DSM. Figure 6.3 shows the corresponding Q-Q plots. 

6.4.2 Aperiodic System Size Distribution 

When hyperperiods are short the system size arguments in Section 5.3.1 can be 
adapted in the natural ways. This applies to system size distributions, as well as 
moments. Figure 6.4 shows sample data for aperiodic system size distributions when 
scheduled in the foreground discipline. Estimates are overly optimistic for small val- 
ues of queue length, and improve for longer values of queue length. Not surprisingly, 
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Figure 6.2: FGA SHM Response Time EDFs 
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Figure 6.3: FGA SHM Response Time Q-Q Plots 
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Figure 6.4: FGA SHM System Size EDFs 
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the overall fit for pi = 0.25 tends to be better than when pi = 0,75, and the overall 
fit improves as p increases. 

6.5 Very Long Hyperperiods 

When hyperperiods are very long, the aperiodic tasks essentially see the behavior 
of an M/M/l queue. Based on empirical observations, we conjecture that when the 
effective periodic utilization is near zero, then the M/M/l model is reasonable. This 
is formalized in Conjecture 6.5.1. 

Conjecture 6.5.1 (FGA VLHM Conditions) For foreground aperiodic schedul- 
ing, when the task blocking probability is less than 0.001, the M/M/l response time 
model provides reasonable estimates. 

In symbols, the M/M/l model applies when 

B (l-wo)Ba 0.001 

Pi = — = — - — < , or equivalently when cj q > 0.999. 

H H 1 — p2 

Obviously, this criterion is somewhat arbitrary and can be specified to the precision 
measured in practice. Note when u 0 « 1, pi « 0 for all systems with p2 96 1, which is 
consistent with intuition. 

Table 6.3 contains some observed sample values under various parameter set- 
tings. In the Q-Q plots (some of which are included next) there tends to be uniformly 
good agreement to the value Xq ) at which point there is often quite rapid divergence. 
Only when p x < 0.0005, are both the observed response time sample mean and stan- 
dard deviation fairly close to the M/M/l mean (m r ), and standard deviation (a r ) 
which is seen in Table 6.4. 
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u; p 
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o>o 


Pi 


MM-fit 


quantile 


4096 


1024 


0.99 


0.74 


0.516 


0.484 


123.63 


0.944 


0.012 


poor 


x(.95) 


8192 


2048 






0.727 


0.273 


108.47 


0.982 


0.004 


moderate 


x(.95) 


16384 


4096 






0.796 


0.204 


97.34 
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x(.995) 


32768 
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1536 


0.99 


0.24 


0.608 


0.392 


126.10 


0.968 


0.024 


poor 


x(.97) 


4096 


3072 






0.786 


0.214 


111.44 


0.993 


0.006 


moderate 


x(.993) 


8192 


6144 






0.906 


0.094 


104.65 


0.998 


0.002 


very good 


x(.998) 


16386 


12288 






0.959 


0.041 


149.38 


1.000 


0.000 


excellent 


x(1.0) 
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96 


0.95 


0.20 


0.749 


0.251 


23.17 


0.951 


0.008 


poor 
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0.947 


0.053 
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0.996 


0.003 


moderate 


x(.995) 
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0.993 


0.007 


36.00 
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0.000 
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x(1.0) 



Table 6.3: FGA VLHM Criterion Evaluation 
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fh r 




771 r — C? r 


4096 


1024 


0.99 


0.74 
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Table 6.4: FGA VLHM Response Time Moments 

6.5.1 Response Time Distribution 

This is the idealized M/M/l response time distribution where 

R{x) = 1 - e- (w_As)l for x > 0. 
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Figure 6.5 shows some sample response time distributions with an M/M/l response 
time overlay. Departures from the M/M/l are often not readily detectable from the 
response time CDFs, but are from the Q-Q plots which are shown in Figure 6.6. 

6.5*2 Aperiodic System Size Distributions 

The idealized M/M/l system size distribution where 

Pr(5 < k) = 1 - p£ +1 for k e {0, 1, 2, ...} 

tends to be somewhat optimistic. Figure 6.7 shows several sample system size dis- 
tributions with an M/M/l system state overlay for the same simulation runs in Fig- 
ure 6.5. Estimates are better for larger values of queue length. 

6.6 Long Hyperperiods 

The long hyperperiod model applies when some periodic blocking occurs, but blocking 
times are almost always limited to less than one periodic compute time. In other 
words, an aperiodic task's completion time contains one or fewer blocking periods, or 
equivalently when u m « 0. When these conditions hold, the blocking time distribution 
is modeled as £ (/?), where (3 is one of the DSM or HTM rate parameters. We formalize 
conditions when the long hyperperiod model (LHM) is a reasonable approximation 
in Conjecture 6.6.1. 

Conjecture 6.6.1 (FGA LHM Conditions) For foreground aperiodic scheduling, 
when the probability of maximum blocking is suitably small and the VLHM does not 
apply, the LHM can be used to predict response time data. 
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Figure 6.5: FGA VLHM Response Time EDFs 
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Figure 6.6: FGA VLHM Repsonse Time Q-Q Plots 



CHAPTER 6. MIXED SCHEDULING: FOREGROUND APERIODICS 



118 



FGA State EDFt; HTM -.; data ♦ ; Ml 1 DSU - 




FGA Slice EOF »; HTM data ♦; UU1 DSfcJ - 




x-am rt»W760; tambd*- 0 20. H- 2048. C» 16»&-1 



x-axfc* (n tyo; rt»l -0.260; tomMa- 0.70; H» 2049. 0 612; S-1 



FQASUteEDFt;HTW-.;«hU«;MM1 ..;DSU — 




FQA Stsc EQFt. HTU -. 




«-«»3»in«y»; rt»1-0760: 



20 90 40 50 60 70 00 

x-ax»*infly*; i*>1 -0190; lambda* 0. 74; K» 8182; O 2048,31.-1 



FGA Sou EOF*; HTM -.; data « ; UM1 




FGA State EOFt ; HTM 



x-«4*.*ifl wm ftol-4789: tanM*- 0.24; H- 1W84. C- 122W; 8L-1 




x-«6a:» in *y»; rtx>1 -0.260; wnnh- 0.74; H- 32708: 0 6102; SL-1 



Figure 6.7: FGA VLHM System Size EDFs 
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More concisely, the LHM is reasonable when 



Pr(B = C)=u m &0 and p x > 



0.001 



(6.14) 




or equivalently, when 



and pi > 



0.001 
1 ~ Pi 



(6.15) 



Empirical evidence suggests u m < 0.03 is suitably small. Under these conditions?, 
typically B b « j3~ l . 

In the next several sections, we derive a model for describing aperiodic response time 
distributions under the long hyperperiod model conditions. 

6.6.1 Aperiodic Queue Lengths during Blocking Times 

When the periodic task is blocking aperiodic tasks for less than the full periodic 
compute time, the aperiodic tasks are in a state of heavy traffic. At the end of 
blocking intervals, the aperiodic queue length distribution can be approximated using 
the heavy traffic models of Section 3.4.1. Our data again shows the degraded server 
approximations are slightly less optimistic and also match the data slightly better. 
Both approximations are given below. 

For the DSM, E[N d ] = AsOu-Ag)' 1 , and a d = (A 2 A)(m-A 2 )^ 1 . The distribution 
of aperiodic system size at blocking interval ends, N dtb , is given by Equation 4.2. For 
the HTM, let N^t be the heavy traffic approximation of aperiodic queue length at 

3 When u> 0 « 1, the number of hyperperiods observed can be small, in which case B b < 0~ l 
sometimes occurs. 
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A 2 


Pi 


E[N d ] 




0 M 




E'N h ] 




0h,c 


LHM region 


0.24 
0.74 


0.75 
0.25 


24.00 
74.00 


24.50 
74.50 


0.0417 
0.0135 


0.01 
0.01 


15.125 
65.125 


0.066 
0.015 


0.016 
0.011 


256 < H < 8192 
1024 < H < 16386 


0.20 
0.70 


0.75 
0.25 


4.00 
14.00 


4.47 
14.49 


0.2500 
0.0710 


0.05 
0.05 


! 2.625 
12.625 


0.381 
0.079 


0.076 
0.055 


64 < H < 512 
256 < H < 1024 


0.10 
0.60 


0.75 
0.25 


0.67 
4.00 


1.05 
4.47 


1.50 
0.25 


0.15 
0.15 


0.542 
3.875 


1.846 
0.258 


0.185 
0.155 


32 < H < 128 
128 < H < 256 



Table 6.5: Predicted Blocking Rates when u) m = 0 



blocking periodic departures. Then 



E[N h ) - 



M 1 -p 1 ) 2 + A 2 
2foa(l-Pi)-A a )' 



so from Equation 3.6 N htb ~ £(E[N h ]~ l ). 

Figure 6.8 shows aperiodic queue length distributions when sampled at the 
departures of blocking periodic tasks. 



6.6,2 Blocking Time Distribution 

When maximum blocking is rare (i.e. u; m « 0), the queue length is expected to 
grow linearly with blocking time. Specifically, during blocking times iV 2 A 2 B and 
hence fi c « A 2 /?«. Table 6.5 lists predicted blocking rates and mean blocking times 
for several system configurations for which u> m « 0, and hence for which e~& eC » 0, 

Conditional on some blocking occurring, the blocking time distribution is es- 
timated by £(0e)' I* 1 other words, Pr(jB < 6 | given some blocking occurs) is given 
by 

Pr(B < b\B > 0) = B p (x) « 1 - e~^ b = 1 - e~ 0b . (6.16) 
Table 6.6 lists several observed values of estimates of ft under a range of parameter 
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Figure 6.8: FGA LHM Queue Length EDFs at Blocking Periodic Departures 
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0c 
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5.722 
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0.049 
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0.043 
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32 
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0.60 
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0.098 


0.130 


0.002 


.15 
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64 






0.205 


0.002 


0.098 


0.000 




512 


128 






0.010 


0.001 


0.096 


0.000 




32 


24 


0.85 


0.10 


0.795 


0.088 


0.138 


0.004 


.15 


64 
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0.269 


0.022 


0.102 


0.000 




128 


96 






0.064 


0.004 


0.097 


0.000 





Table 6.6: Observed Blocking Rates when u; m .« 0 

settings. Figure 6.9 shows some blocking time distributions for the foreground aperi- 
odic scheduling discipline when u) m « 0. For the smaller utilizations (e.g. p = 0.85) 
and the longer hyperperiods, (3 C tends to be smaller than estimated. Under these 
conditions, only a small number of blocking hyperperiods are observed and shorter 
blocking times are more likely to be observed than longer blocking times. 4 

6.6,3 Response Time Distribution 

In this section, we develop a response time distribution approximation for systems 
satisfying the FGA LHM conditions. Define x 0 = Xq 0 = -ln(l — u>o)(^2 — A 2 ) _1 so 



4 For X u X 2 , ... iid £(/?), and X k = (J* ml X,)/fc, then P(X k < /3" 1 ) = P(G 0yk < k0~ l ) > 0.5 
for k small. 
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Figure 6.9: FGA LHM Blocking Time Distributions 
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that 



Rmixo) = 1 - 6 -(»-*»>*o _ £ 0 = UQ + (i _ Wo )(i _ 



-1 



# (1 - ft) 



)«1- 



where /? is the mean blocking rate conditional on some blocking occurring and u>o is 
the task blocking probability. Note that the long and very long hyperperiod response 
time models agree on [0,x 0 ]. 

Recall from Section D.l.l the development of the BGA LHM response time 
CDF when the blocking time distribution B ~ V(C). Define 



R{x\B>0) = f R(x\B = b)dB p (b) = [ R(x\B = b)/3 c e-^ 
Jo Jo 



where R(x\B = b) is the BGA LHM response time distribution. For ease of exposition 
in this section, let a denote [if (1 - p2)]~ x - Then, 



R{x\B = 6 > 0) = < 



0 for x < x 0 

1 — a(b + Xq — x) for x G [x 0 , b + Xo] 
1 for x > b 4- x 0 



(6.17) 



For x G [xo, xo + 6], then b G [x - Xo, C]. For x > xo + 6, then 0 < b < x — xq. 
Just as for the BGA LHM, we assume the support is non-zero only for x G [xo, xq+C]. 
Thus, 

R(x\B > 0) = / 1 ■ dB p (b) + (1 - a(x 0 - x + b))dB p (b), 

JO J(x-x 0 ) 
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where B p (b) is defined in Equation 6.16. Further computing gives 

R(x\B > 0) = (1 - e-^ 1 - 1 ")) + (1 - a{x 0 - x)) g_ Xf> e'^db 
-a/3 f c be- 0b db 

~ JX — XQ 

V V n (6.18) 

« 1 - ctp- x erft*-*ti when e~ pc « 0. 

Note that for suitably long C, i?(xo|B > 0) « 1 — F^H" 1 ^ which is the probability 
an individual task (within a hyperperiod) will not block given some blocking occurs 
in that hyperperiod. For a task that experiences no blocking, the response time 
distribution is that of an M/M/l queue for r < x 0 . In a non-blocking hyperperiod, 
we make the simplifying assumption Pr(R > x Q \B = 0) = 0. For suitably long 
hyperperiods, this approximation can be justfied as follows. A bit of reflection reveals 
that Pr(fl < x\B = 0) > Pr{R < x). So for x > x 0i u 0 <Pr(R < x) < Pv{R < x\B = 
0). Once observing u> 0 < &o an application of Conjecture 6.2.1 gives u>o < u>o t 1 as 



Assuming all this, for x € [x 0 ,C + x 0 ], ft 1S approximated by the mixture 
R { (x) = v 0 • 1 + (1 - w 0 )(l - a/J-V^-* 0 *). By assumption, e~ pc « 0, so i^s) = 1 
for x > C + x 0 . This approximation is summarized in Equation 6.19. Once observing 
that e ~P( x ~ x °) is the probability that a new arrival sees a blocking time in excess of 
x — Xq, the similarity in form of Equation 6.19 with Equation 5.9 is noteworthy. 



tf too. 



0 



for x < 0 



ft(x) = < 



1_ c -(w-a*)« forxG [0,x 0 ] 

1 - - P2)]- I e^^°) for x € [a: 0 , C + x 0 ] 



(6.19) 
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Figure 6.10: FGA LHM Response Time EDFs 



CHAPTER 6. MIXED SCHEDULING: FOREGROUND APERIODICS 



127 



FGA Resp True O-Q Piatt: OSM:- -; WW-. ; LHM: - ; VLHM ... FQA Rexp Time O-O Plots; OSU - -; MW-. ; IHM. VLHfci ... 




0 & 10 15 20 23 0 5 10 15 20 25 30 » 

*F*MP): twntxto - 0.100; H - 04.000; C - 46 000; wQt » 0 »7: (1»1i - 0 002 *ttwfl»; tefrtxto - 0000; H - 1 28.000: C « 52,000; wOl « 0.086; moll - 0.008 



FGA Rap Tme O-Q Ptau. DSM- -; WK ; LHM: -; VLHM ._ f OA Reap Ttae Q-O Ptats, OSM - - ; WM-. : LHM.- -; VIHM:... 




KFttwoa; t«*da " 0.240: h * 6i 2.000. c - sm.ooo: wot - 0020 mon - 0.1 » x:WnsKp): tamMa - 0 7+0; h » 2046.000; c - 612.000: vrt* - 0 804: moii » 0.028 



Figure 6.11: FGA LHM Response Time Q-Q Plots 
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Figure 6.10 shows several sample response time distributions plotted against the long 
foreground aperiodic hyperperiod response time model. Figure 6.11 shows the corre- 
sponding response time Q-Q plots. 

6-7 Intermediate Hyper periods 

System configurations that fall in the intermediate hyperperiod category have each 
of ix>o,u)p and u m 96 0. The number of response time bands will be two or greater. 
With the exception of the lowest band where some M/M/l behavior can be exhibited 
throughout the hyperperiod, the response time curves fall strictly within the bands 
just as they did for background aperiodics. 

The intermediate hyperperiod model (IHM) we are about to propose applies 
over a broad range of conditions, and when near the long hyperperiod model condi- 
tions (i.e. u> m » 0) is comparable to and slightly more optimistic (in the right tails) 
than the long hyperperiod model. The intuition behind the development of the FGA 
IHM response time distribution is that there is some M/M/l behavior, but otherwise 
the response times are governed by the DSM model. When the FGA IHM and FGA 
LHM distributions are close, we choose the LHM since it is more conservative. 

Conjecture 6.7,1 (FGA IHM Conditions) In the foreground aperiodic schedul- 
ing model, the IHM is a candidate estimator of response times when A 2 satisfies 
Equation 6.20. 

A 2 < ^pr^l ^d u; m > 0.03. (6.20) 

The left hand condition is simply the negation of the SHM conditions in Conjec- 
ture 6.^.1 and similarly the right hand condition is the negation of the LHM conditions 
given in Conjecture 6.6.1. 
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Figure 6.12: FGA IHM Response Time Data Bands 
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Figure 6.13: FGA IHM Response Time EDFs 
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Figure 6.14: FGA IHM Response Time Q-Q Plots 
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Figure 6.12 shows some sample response time bands, with response times as 
the y-axis, and time of arrival within the hyperperiod as the x-axis. Equation 6.21 is 
an approximation for Ri{x)> the probability that the response time does not exceed 
x in an arbitrary hyperperiod. 

Ri(x) = 1 - Q 0 e- { ^- X2)x - u> b e-&- x *) x . (6.21) 

The derivation is easy. For the task that experiences no sources of blocking, 
the response time distribution is that of an M/M/l. For the task that experiences 
blocking, we approximate the response time distribution as a DSM. This leads us to 
a mixture for a response time distribution, 

Ri(x) = u> Q R m (x)+u; b R d (x) 

= d/ 0 (l - e-(* 2 - Aa > x ) + Q b (l - e -^~ A2 > x ), 

where Rm is the M/M/l response time distribution and R d is the DSM response time 
distribution, which can be rewritten as Equation 6.21. 

Figures 6.13 and 6.14 contain sample EDFs and Q-Q plots, respectively. The 
FGA IHM does not fit as well as the BGA IHM, but it also uses much less of the 
data (or equivalent) in its formulation. For pi — 0.75, the BGA IHM appears to 
fit moderately well. When p > 0.95, the IHM is often optimistic in the midrange 
quantiles when p\ = 0.25. Equation 6.21 appears to work better for heavy traffic 
(e.g. p = 0.99) and tends to be optimistic for lighter traffic. 



Chapter 7 



Future Work 

This thesis has looked at a few very special cases of the larger problem of predictably 
scheduling multiple sources of traffic, each with different service requirements, possi- 
bly including hard deadlines (which was the case for our periodic tasks). The general 
problem is not limited to a single server, but includes analyses for networks of queues. 

Even for the special cases considered, we found we needed a collection of models 
to describe the response time (and in some cases system size) distributions of the 
aperiodic tasks. Within this collection, several models' and/or their selection criterion 
parameters were estimated using simulation data and not analytically derived. We 
begin by reiterating the analytically unknown parameters and conclude with a list of 
system generalizations which are candidates for future work. 

7.1 Unknown Parameters 

For serveral cases, we reduced the problem of analyzing certain aperiodic performance 
measures in a hard real-time system from one of determining entire distributions to 
one of selecting a parametric model. In a few cases, the parameters were analyti- 
cally derived, but there are several cases where observed values for model parameters 
and/or model selection criteria were used. Finding mathematical descriptions for the 
model parameters enumerated below remains a challenge. 

1. All models or selection criteria for the FGA scheduling discipline make (either 
direct or indirect) use of the value of ojq — Pr(no blocking within a hyperperiod 
occurs), which must now be empirically observed. 

133 
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For the M/M/l (VLH) model, the criteria is uj 0 > 0.999. For both the IH 
and LH models, too = 1 — Pi(l — P2)" 1 is & part of the response time equation. 
For the SHM, p x can be used in the evaluation criterion, where p\ = BH~ X = 
(l-wb)Atf- 1 . 

2. For the FGA scheduling discipline, the selection criterion boundary when choos- 
ing between the IHM and the LHM makes use of the value of u m = Pr(maximum 
blocking within a hyperperiod occurs), which must be observed. In some sense, 
blocking times of zero and C represent boundary conditions when the blocking 
time distribution is captured by a diffusion process. This suggests a solution 
technique for finding cuq may also be applicable for finding a/ m . 

3. For the FGA scheduling discipline, the selection criterion boundary when choos- 
ing between the SHM and the LHM makes use of p\ = BH~ X . In this case 
u m 96 0, so B b 96 /J" 1 , hence B (or equivalently) must be observed. 

4. For the BGA scheduling discipline, the IHM, the vector P* had to be observed. 
We proposed obtaining these observed values via process/system simulation. 
This has the disadvantage that the values obtained need not be representative 
of a randomly chosen sample path. In particular, our values were exact since we 
used system simulation data to obtain them. Obtaining the values using process 
simulation potentially suffers from being sensative to initial state configurations 
and/or insufficient data for a good estimate (although not usually a problem in 
simulation). 

An alternative potentially viable approach might be to characterize the lim- 
iting distribution of the completion time process (which we have begun with 
our characterization of the blocking time distributions) and model the system 
as an M/G/l queue. As we have seen, in principle we can obtain the LaPlace 
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transform for the response time distribution in terms of the completion time 
distribution. However, inversion of LaPlace transforms can require advanced 
techniques in complex analysis and/or numeric (inversion) methods. Once in- 
verted, the integration to obtain the cdf might introduce additional numeric 
approximations. Also, any individual solution need not shed light on similar 
but different parameter settings. 

In addition to requiring observations for model and/or model selection criteria 
parameters, sometimes models either did not fit well or no models were proposed. 

1. For the FGA scheduling discipline, in the IHM range, the fit was not particularly 
good for "moderate" traffic. An approach similar to our background aperiodic 
IHM, using linear piecewise estimation for the response time bands would likely 
have produced better fitting models in non-heavy traffic. Since our focus was 
on heavily loaded systems and having done something very similar for the BGA 
IHM case, we opted to restrict focus on the applicability of models with closed 
form expressions. 

2. In all cases, our investigations were restricted to p\ G {0.25,0.75}. We suspect 
that similar criteria will hold for p x 6 [0.15,0.85], and probably more broadly. 
These two values were chosen with the hope of capturing the impact of periodic 
task scheduling on aperiodic tasks when neither class largely dominates the 
system. 

3. For the BGA scheduling discipline, system size models were not developed for 
the IHM case. For the FGA scheduling discipline, except for VLHM and SHMs, 
system size models were not developed. 

The model boundaries defined by the model selection criteria will never be 
sharp, since in some sense the models are (topologically) close to one another near 
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the boundary conditions. However, in several cases our selection criteria were based 
on observations of parameter values with little more than an appeal to the intuitive 
reasonableness of their plausibility. With a more precise formulation (and greater 
understanding) of the model parameters, better motivated model selection conditions 
are likely to result. 

7.2 System Model Generalizations 

For the greatest chance of tractable solutions we have made the simplest of all as- 
sumptions about periodic and aperiodic properties for our system description. In all 
applications we have encountered, systems are considerably more complex than what 
we have assumed. In what follows, we sketch several complicating assumptions quite 
likely to occur in practical situations. We restrict focus to a single server, although 
the predictable behavior of networks is also of great interest in practice. 

7.2.1 Multiple Periodic Streams 

In hard real-time systems there are typically many periodic tasks which may have 
different periods. Perhaps the best known preemptive fixed-priority scheduling al- 
gorithm for periodic tasks in hard real-time systems is Rate Monotonic Scheduling 
(RMS,[27]). In RMS, there is a set of n periodic tasks with periods T = (7\, T 2 , T n ) 
and compute times C\ = (Ci,!, Ci >2 , Ci,n). The execution of the periodic tasks cy- 
cle every hyperperiod (if), where H is the least common multiple of the periods, or 
H = lcm{Ti,T 2 , ...,T n }. The periodic utilization is defined by, p x = C hj T~\ 

When scheduling aperiodic tasks in RMS systems, a background aperiodic 
policy was often adopted. Within the last decade, several algorithms have been 
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devised to give improved response time to non-critical aperiodic tasks while guaran- 
teeing the hard deadline requirements of the periodic tasks. In our opinion, the slack 
stealer ([24]) provides the most robust design basis for algorithms in this class- To 
our knowledge, mathematical descriptions of aperiodic task behavior when scheduled 
using these algorithms remains largely undeveloped. When there is a single peri- 
odic task, our foreground aperiodic scheduling discipline is a special case of the slack 
stealer (and also of an algorithm known as the sporadic server). 

We now consider the effects of multiple periodic tasks on aperiodic response 
times when compared to a single periodic task for fixed H and pi (we are considering 
more than just RMS). As an illustration, consider an example with which we are 
already familiar. Figure 7.1 shows cumulative aperiodic timeline availability as a 
function of time for two different task sets. The solid line corresponds to a single 
periodic task with period H x and compute time C\. The dot-dash line also corresponds 
to the case a single task with period #2 = Hi/m and compute time C2 = Ci/m, for 
m a positive integer. An alternative view of the dot-dash line is there are m tasks 
evenly spaced tasks throughout a hyperperiod of length Hi each with compute time 
C2 = Ci/ra, The bottom lines are the exact aperiodic timeline availabilty that would 
be seen by an aperiodic stream when using the BGA scheduling discipline. The top 
lines are the maximums of the aperiodic timeline availability when using the FGA 
scheduling discipline. The maximums occur only when the system is constantly busy. 

Let Si be a schedule with a single periodic task, and let S2 be a schedule 
with multiple periodic tasks but with equal H and p\. It is an easy exercise to show 
that for any aperiodic arrival process, when using a foreground aperiodic scheduling 
discipline, aperiodic response times in Si will be smaller than aperiodic response times 
in 52 for the simple reason that the maximum aperiodic timeline in Si is no less than 
the maximum aperiodic timeline in 52. Note that in the absence of aperiodic arrivals, 
the aperiodic timeline will decrease, and Si also has the ability to decrease more than 
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Figure 7.1: Aperiodic Timeline Availability 

S2 implying that the available slack in 5i is always greater than that in S 2 . Hence, 
our response time results for FGA scheduling are optimal compared to all other fifo 
aperiodic disciplines with equal if, p\ and guaranteed hard deadlines for periodic 
tasks. 

One might initially think a similar argument applies to S\ and when using 
the background aperiodic scheduling discipline. However this need not be the case 
since both the time of aperiodic task arrivals and the amount of pending work de- 
termine response times. It is easy to construct sample paths for which the response 
times in Si are better than those in S 2 * For example, consider an amount of aperiodic 
work W a , H 2 - C2 < W a < Hi - Ci arriving exactly at time C\. In Si, no blocking 
occurs and in S2 some blocking occurs. One must average over all sample paths and 
look at the limiting distributions to decide if Si performs better than 5 2 in terms of 
providing a stochastically smaller response time distribution to aperiodics. Despite 
this counterexample, we suspect that the BGA LHM is pessimistic compared to other 
models with equal hyperperiods and equal periodic utilization when averaging over 
all sample aperiodic arrival paths (perhaps assuming that in the long run, aperiodics 
will arrive roughly equally throughout the entire hyperperiod). 



CHAPTER 7. FUTURE WORK 



13& 



When even spacing is applied in Si as illustrated in Figure 7.1, the LHM is 
transformed to a DSM asmf. Suppose ^2 = 1 and consider the BGA SHM criterion 
rewritten as 

a< (1- Pl )(l-X 2 Y 
Alternatively, rewrite the BGA LHM criterion as 

Mi-A 2 ) 

(l-\ 2 -p x )r 

Assuming the validity of these two criteria, this tells us that if under configuration Si 
the LHM holds, then we can find an integer m such that 52 becomes the DSM analog 
of Si. Recall that when the BGA LHM holds, the DSM was much too optimistic. 

Looking at non-uniform distributions of periodic execution timelines remains 
for future work for both foreground and background aperiodic scheduling disciplines. 
Another practical area for investigation is allowing the periodic exectuion time to be 
random but bounded (by C). Unused periodic compute time can be reclaimed and 
reallocated to aperiodic tasks. However, an analysis using the worst case periodic 
compute times will provide a stochastically larger aperiodic response time distribution 
for a single server. 

7.2.2 Different Aperiodic Interarrival/Service Distributions 

We have assumed that both the aperiodic interarrival and service time distributions 
are exponential. If all our arguments were based on heavy traffic theory (rather 
than the degraded server model for a fifo M/M/l queue - which we often opted for 
because they were slightly more pessimistic in heavy traffic and much more accurate 
in moderate traffic), then we suspect similar arguments and results would apply under 
conditions of heavy traffic. 
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The only calculation used that made very explicit use of the the M/M/l as- 
sumption was the mean return time to the M/M/l interval in the BGA LHM. When 

there are no changes in the BGA IHM, and the BGA DSM would instead use 
parameters derived using heavy traffic theory. Our analysis of the FGA scheduling 
discipline makes use of the known response time distribution for the M/M/l queue 
when describing task response times that do not block. Since these tasks are not in 
heavy traffic, a heavy traffic approximation is not appropriate in this region. Any 
techniques applicable for estimating response time distributions in the absense of pe- 
riodic processes are applicable for this region, which does not define the right tails of 
the response time distribution. 

7.2-3 Multiple Aperiodic Streams 

We considered only a single source of aperiodic traffic. In practice, there might 
be multiple sources of aperiodic traffic. If there are priorities among the aperiodic 
streams, the analysis would proceed first for the highest priority aperiodic stream, 
and then for the next highest priority aperiodic stream, etc. If all aperiodic streams 
have lower priority than a single periodic stream, then depending on total utilization, 
the highest priority aperiodic stream might be analyzed using techniques developed 
here when the criterion hold. If the system is near saturation, the lowest priority 
stream will be in a state of heavy traffic and might be analyzed using heavy traffic 
theory. 

When there are two or more independent aperiodic streams, each with a Pois- 
sion arrival process and fixed priorities, a completion time distribution for low(er) 
priority aperiodic traffic using techniques similar to those presented in [17] can be de- 
rived. To develop the response time analysis for the low(er) priority aperiodic stream 
in a system with a periodic traffic stream and higher priority (or priorities) aperiodic 



CHAPTER 7. FUTURE WORK 



141 



traffic, the completion time distribution would be used to describe the low priority 
aperiodic traffic's service time distribution. When not all aperiodic arrival processes 
are Poisson and service distributions are general, a tractable exact analysis (in the 
absense of periodic traffic) does not exist in any general case. Nonetheless, determin- 
ing sensitivities to the various assumptions would be useful when estimating response 
times for real applications. 

7.2.4 Different Aperiodic Service Disciplines 

By assuming a FIFO aperiodic task queue, aperiodic tasks are served in order of 
arrival. It is a more challenging problem when individual tasks are allowed to have 
distinct deadlines (drawn from a common distribution) , in which case a more optimal 
scheduling discipline might assign (dynamic) priorities (within a class) giving highest 
priority to the task with the shortest time to deadline to minimize the number of 
missed deadlines. 

The well known earliest deadline first (BDF) scheduling discipline assigns high- 
est priority to the task with the smallest deadline and uses preemption to ensure that 
the task with the highest priority is receiving service. The optimality of EDF has long 
been known ([27]) to minimize the number of missed deadlines. Still, the response 
time distribution for tasks with individual deadlines remains unsolved except in very 
special cases. 1 

An alternative to EDF is the least laxity (LL) scheduling discipline which 
is non-preemptive (among aperiodic tasks). Each task has a laxity, which is the 
maximum time it can wait before beginning execution. The LL discipline will select 
the task with the smallest laxity only at aperiodic task departure times (assuming 
the server is busy; arrivals finding an idle server are started right away). Note that 



When all task deadlines are the same constant, then fifo is the same as edf.[25] 
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the LL as defined, does not make use of execution times in its decision rule for which 
task to schedule. Certain optimality properties about LL are also known ([30]). 

In more recent work ([25], [26]), a closed form distribution for leadtimes when 
conditioning on a fixed queue length is derived for the EDF scheduling discipline under 
conditions of heavy traffic. The leadtime of a task is the deadline minus the current 
time. If the queue length distribution were known, then the unconditional leadtime 
distribution could be calculated which could be used to advise the developer of the 
probability that an individual task would miss its deadline. There axe no deadline 
guarantees for select traffic in this model. Our (partial) queue length analyses and 
region selection criteria might be useful in determining when the conditional leadtime 
analysis results could be expected to apply to mixed hard deadline periodic and 
aperiodic sheduling problems, and perhaps to approximate unconditional leadtime 
distributions. 



Appendix A 



Notation 



Table A.l summarizes performance variables, their distributions and parameters used 
throughout this thesis. Much of this notation is first introduced (in the context of a 
single class of traffic) in Chapter 2. 



Notation 



Description 



'*,n 

H 

C 

At 

Bi{x) 
Mi 

772 i 

<i 
Pi 

1 



The n tn task of class i to arrive to the system. Periodic tasks have i = 1 and 

aperiodic tasks have i = 2. 

The (hyper)period of the periodic tasks. 

The compute time a periodic task. 

A random variable defining the n th interarrival time within the class i arrival 

stream. {T itn } forms an iid sequence of random variables. 

The class i interarrival time distribution. Pr(T^ n < t) = Ai(t). 

The class i interarrival rate. E[T ifn ] = (AO -1 . 

The class i interarrival time variance. Vax[Ti jn ] = a\ { , 

A random variable defining n th class i service time. {A^ n } forms an iid se- 
quence of random variables. 

The class i service time distribution. Pr(Xi in < x) = Bi(x). 

The class i service rate. E[Xi }Tl ] = (a**)" 1 - 

The mean service time for a class i task. E[X^ n ] — m*. 

The class i service time variance. Var[X<, n ] = a\ v 

The class i utilization, pi = A*/*" 1 . 

The system utilization, p = pi + p2. 



Table A.l: Performance and Limiting Distribution Variables 



State variables are shown in Table A.2. It is common to use the same mnemonic 
designations for process state variables and for limiting distribution variables (since 
they will often converge to a random variable defining the steady state distribution). 
The context of their use should make their meaning clear. State variables first ap- 
pear in Chapter 2 in the context of a single class of traffic. They are later defined 
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for multiple classes in Chapter 3. The bottom portion of Table A.2 defines some 
centered and scaled random functions used in weak convergence arguments. Random 
functions defining the n th in a converging sequence of processes are superscripted. 



Notation 



Description 



Mt) 

N(t) = N t 
Bi(t) 

Xi(t) 
Idt) 

Di(t) 

W) 

Qi(t) 

Q(t) = Qt 
Wi(t) 

W(t) = w t 



A class identifier index. Typically, for i - 1 the class is periodic, and for i — 2 
the class is aperiodic. 

The number of class t arrivals in [0,tj. Ai(t) = max{fe € 

{o.i.2,..-}|E;»ir w <e}. 

The number of tasks in the system at time t. N(t) = Ni(t) + N 2 (t). 

Bi(t) = max{fc € {0, 1,2, ...} | Xij < t}> The number of class i departures 

in [0, t] given full processing capacity. 

The cumulative class i workload requested in [G,<]. Xi(t) = J^fjjjp Xij- 

The cumulative class i idle time in (0, t) which is the amount of time available 

for class i tasks during which no class i customers were present. 

The number of class i departures in [0,t]. Di(t) = Si(t - Ii(t)) and D 2 (t) = 

-MO)- 

The number of class i tasks in the system at time t. N{(t) = Ai(t) - D{(t), 
when Ni(0) = 0. 

The number of class i tasks queued at time t. The relationship between Ni(t) 

and Qi(tf) varies with different priority schemes. 

The number of tasks queued at time t. Q(t) = Qj(t) + Q 2 (t). 

The unfinished class i work at time t. With multiple classes, the interpretation 

of a virtual waiting time must be adapted. 

The unfinished work at time t. W(t) = W x (t) + W 2 {t). 



B? 



A? = n-*[Ai(nt) - XiUt], the n th scaled and centered class i arrival counting 
process. 

X? = n~^[Xi(nt) - pint], the n th scaled and centered class i cumulative work- 
load request process. 

SJ* = n~£[]£^i - to,)], the n th scaled and centered class i service time 
process. 

Bf = n"3[J3i(nt) - ^nt], the n tA scaled and centered class i arrival counting 
process. 



Table A.2: Notation: State Variable Descriptions 



Appendix B 

The M/M/l Queue 



Table B in this appendix lists many well known facts about the M/M/l Queue, most 
of which have been used in this thesis. Most all of the listed results can be found 
in [35] (Takacs). Alternative references for many of these results are [1], [21] and [22]. 
A single theorem is also stated, since it is referenced from Appendix C. 

Lemma B.0.1 (The mean and variance of a busy cycle.) The mean of an idle 
period is and the mean of a busy period is (fi — A)" 1 . So the mean of the busy 
cycle is 

E(BC) = M 



a(m-a)' 

The variance of an idle period is A" 2 and the variance of a busy period can be 
shown to be (l-J-p)[ju 2 (l — p) 3 ]" 1 ([SI]). Since the busy and idle cycles are independent 
(by the assumption of Poisson arrivals), the variance of the busy cycle (an adjacent 
busy and idle period) is 

Vai(BC) ~ AV(l-/t>) 3 ' 
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Notation 


Description 


A 

Ait) 
H 

Bit) 

m 

P 

Q(t) 

N(t) 
E{N) 
Var(JV) 
1 

E(l) 
B 

E(B) 

R(t) 

E(R) 

Var(fl) 

W(t) 

nk 

E[k^k~ 1] 


The average arrival rate. 

The distribution of the interarrival process. For the M/M/l, A(t) = 1 - e~~ xt . 
The arrival process forms a renewal process. 
The average service rate. 

The distribution of the service time process. For the M/M/l queue, B(t) = 
1-e-**. 

The average service time. This notation is used for more general processes 
(with non-exponential service times), m = 

Traffic intensity or utilization, p = A/*"" 1 = Am. Sometimes we denote p by 
U for utilization. 

The queue length process as a function of time. 

The number in the system process as a function of time. As t oo, P(N < 
n) = l-p n+1 . 

The expected value of the equilibrium queue length process. E(N) = - 
A)- 1 = p(l - p)~K 

The variance of the equilibrium queue length distribution. Var(JV) = 
(A/i)(^-A)- 2 = p(l-p)- 2 . 

A random variable describing an idle interval of the equilibrium process. For 

the M/M/l, I ~ exp(A). 

The expected length of an idle interval, A -1 . 

A random variable describing a busy period of the equilibrium process. For 
the M/M/l, the distribution of B is known. See [35]. 
The expected value of a busy period duration. E[B] = (/i - A)" 1 . 
The expected number served in a busy period. Ni> — ///(/* - A). The distri- 
bution is also known. 

The equilibrium response time distribution for a fifo queue (includes waiting 
plus service). R(t) = 1 - e~^" A ^. 

The mean of the equilibrium response time distribution. E(R) = (fi — A) = 
Ai-l(l-p)- 1 . 

The variance of the equilibrium respone time distribution. Var(il) = (/* - 
A)- 2 = M- 3 (1-P)- 2 - 

The equilibrium waiting time distribution for a fifo server. W(t) = 1 - 

The probability the system is in state A;, or equivalently, contains k tasks 
where A: G {0, 1,2, ...}. n k = (1 - p)p k . 

The mean time to first transition from state k to state fc-l,fce{l,2,3,...}. 
-!] = (/* -A)- 1 . 



Table B.l: M/M/l Steady State Distribution Variables 



Appendix C 
Limit Theorems 



C.l CLT for Renewal Processes 

Lemma C.l.l (Central Limit Theorem for Renewal Processes.) LetXi, Xz, 
X$,... be independent and identically distributed with E[Xj] = p, and Var[Xj] = a 2 . 
Define Sj = J2 m =i-Xm- Let N(t) be the counting process for the sucessive times 
between the Xj. In symbols, N(t) = max{j : Sj < t}. 
Then, 



As an application of the CLT for renewal processes, consider an approximation 
for the number of idle intervals in an M/M/l queue during a hyperperiod of length 
H. Then, 

m = — + 



A (ji-X) A(/i-A)' 

• 1 (1 + p) M 2 (l-P) 3 + A 2 (l + P) 
A 2 p?(l - P) 3 A 2 /u 2 (l - P? 

so to approximate N H , the number of idle intervals in [0, H) (or [kH, (k + l)H) for 

k E {0, 1, 2, ...}) when H is large we have 

N(H) ~ M(HX(1 - p), HX{1 -3p + V)). 
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C.2 The PASTA Property 



Lemma C.2.1 (The PASTA Property.) PASTA stands for "Poisson Arrivals See 
Time Averages". More technically, let S = {S(t)\t > 0} define the system state. Let 
B be an arbitrary collection of states in 5. Let P = {P(t)\t > 0} be a Poisson process 
with rate A > 0. In applications considered in this thesis, P will generally be the 
arrival process of aperiodic tasks to the system defined by S, but it need not be an 
arrival process at all 

We say that S is nonanticipating with respect to P if for every t > 0, {P(t + 
u) - P(t)\u > 0} is independent of both {S(v)\0 <v<t} and {P{v)\0 <v<t}. 

Let U{t) = [S{t) € B), that is U(t) is the indicator function of S{t) in the set 
B. Also, define 



Then, when U is nonanticipating with respect to P and has left- continuous 
sample paths with right limits, U(t) — ► U(oo) w.p.l if and only if A(t) £7(oo) 
w.p.l, as t -» co. 

In practical terms, this theorem says that the fraction of Poisson events ( ar- 
rivals) that find (see) S € B (i.e. A(t)) is the same as the time average of the system 
in state B (i.e. U(t)). 

For a proof of the PASTA property, see [38]. 





and 




An application of the PASTA property gives the virtual waiting time distribution for 
Poisson arrivals is equal to the actual waiting time distribution. 
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C.3 Brownian Motion 

Let $ t be a Weiner process with density 




Lemma C.3.1 (Brownian Motion with Reflection at Zero,) LetB% be a Brow- 
nian motion with drift m < 0 ; variance a 1 and with a lower control barrier that 
behaves like reflection at the origin. For B t > 0, we have B t ~ N(mt,aH). Let 
M t = sup 0<s<t B s . It can be shown that Mt B t [14]} Let t — > oo, so M^ ~ B^. It 
is well known ([2], [14], [20]) that the limiting distribution of Mt is given by 

lim Pr(M t <x) l-e 2mx ' a \ 

t-*OQ 

hence 

lim Pv(B t < x) l-e 2 ™'" 2 . 

t->oc 



Lemma C.3. 2 (Linear Combinations of Weiner Processes.) Let and £ 

be Weiner processes with £i and £2 independent Then for constants c\ and c%, 

Cl£l+<^2~(c? + <3)*£ 

The result is an easy exercise. 



1 This is not obvious. See Section t.9 of [14]* 
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Virtually all the work referenced in this thesis that makes use of weak convergence 
arguments to find limiting distributions are based on Donsker's Theorem. We have 
included a statement of a version of Donsker's theorem for the sake of completeness. 

Lemma C.4.1 (Donsker's Theorem.) Let D = E>[0,1] be the space of functions 
on [0,1] with discontinuities of at most the first kind. Let Fi,}^, ... be iid random 
variables defined on (fi, 0,P) with partial sums S n = Y\ + Y2 + ... + Y n . Further 
suppose, E[Y n ] = 0 and Var(y n ) = o 1 < 00. Define X n at each time t by 

Xn{t ' u) = ^ S[nt]{u) - 

Note that for each u G Q,X n (u) e D. Then lim n ^ooX n (<) & where & is a Weiner 
process, or equivalently & ~ Af(0, t). 

Proofs of Donsker's theorem can be found in [4], Theorems 10 (pg 68) and 
16 A (pg 137). Donsker's theorem is also known as the invariance principle and as 
the functional central limit theorem. 

Many extensions to Donsker's theorem have been made. For example, specifi- 
cation of non-zero means, relaxations of the iid assumption for Fi, K 2 , ... and increases 
in the domain of D (e.g. D[0, 00)). These modified versions have been applied and/or 
developed to obtain weak convergence results for queueing systems in heavy traffic. 

In all cases (we used), a sequence of scaled processes is defined where time is 
scaled by n" 1 and space (e.g. queue size) is scaled by n~ l l 2 in the n th process. We 
denote the n th (scaled) process at time t by X$ = X n (t). As n 00 the limiting 
diffusion is denoted by X°°(t) = X(t) ~ The scaled sequence is superscripted 
leaving subscripts for task class. 
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Examples using these scalings applied to queues can be found in [5], [13], [15], 
[16], [18], [25], [26] and [37], to cite just a few. The limiting process (as n -+ oo) defines 
the diffusion process for which we seek an equilibrium solution (when it exists), subject 
to suitable boundary conditions. Chapters 2 and 3 contain more detailed examples 
illustrating scaled processes and applications of weak convergence arguments based 
on Lemma C.4.1. Alternative scalings might be useful for lighter loadings. 



Appendix D 

Algorithms and Computations 



D.l Some Model and Parameter Calculations 
D.l.l Long Hyperperiod Models 

In Section 5.2.2, a response time CDF for the background aperiodic scheduling dis- 
cipline was developed for long hyperperiods. In that development, a point mass at 
xq defined the probability that a newly arriving task experienced no blocking delays 
incurred by periodic tasks. In our system, the actual response time distribution will 
be continuous. For the BGA Long Hyperperiod Model (LHM) computation, xo can 
be thought of as the point at which the response time behavior for values of r < x$ 
is defined by an M/M/l queue, and for values r > x 0 response times are defined by 
the LHM developed in Section 5.2.2. For this definition, xq satisfies 

jR m ( Xo ) = u/ 0 = 1 - e-<"»-** = Ri(x 0 ) = 1 - Pi (l - p 2 )' 1 

with a simple calculation giving 

-ln((p 1 )(l-ft)- 1 ) a 

xo decreases for fixed p\ and increasing p, which one would expect since the response 
time range for which the M/M/l applies will decrease as total traffic increases. Also, 
for fixed p, xo increases with decreasing p x which would also be expected since the 
M/M/l portion of the interval increases as pi decreases. 
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Using this definition for x 0) the resulting response time CDF is given in Equa- 
tion D.l. 



Ri{x) = < 



0 for x < 0 

1 - e - (w - Aa)x for 0 < x < x 0 

1 - (T^)(l - ^) for a* <x<B + x 0 

1 for x > B + xo 



(D.l) 



Except for very long hyperperiods our data suggests that this value of Xq errors on 
the side of being too small, especially when the system is moderately loaded. 

When deriving the LHM for aperiodic system size in Section 5.3.2, we assumed 
a value for n 0 , the aperiodic system size at the start of the hyperperiod. For consis- 
tency, we define n 0 as the aperiodic workload (x 0 ) divided by the mean processing 
time per aperiodic task (^J 1 ), or equivalently, n 0 = Xq^, which is also written as 

n ° = — (i^) — • 

Table D.l lists several values of n 0 for varying p2 and fixed pi. Like x 0 , n 0 
decreases for fixed p\ and increasing p, which one would expect since the response 
time range for which the M/M/l applies will decrease as total traffic increases. Also, 
for fixed p, n 0 increases with decreasing pi which would also be expected since the 
M/M/l portion of the interval increases as p\ decreases. 

To calculate x<) (and no) for the foreground aperiodic scheduling policy, one 
must first observe from the data either 5, or u)q where 

1 R 

d>0 = U)Q + (1 ~ CJ 0 )(1 - nrjf, \) = 1 



mi-pi)' H(i- P2 y 

and /3 is the mean blocking rate given some blocking occurs. Recall that in the FGA 
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p 


P2 


Pi 


N 2 


no 


0.99 


0.74 


0.25 


2.846 


0.0523 


0.85 


0.60 




1.500 


1.1750 


0.80 


0.55 




1.222 


1.3060 


0.99 


0.24 


0.75 


0.316 


0.0174 


0.85 


0.10 




0.111 


0.2030 


0.80 


0.05 




0.053 


0.2490 



Table D.l: Values for rz 0 in the LHM 

LHM, maximum blocking does not occur, and 0 can be derived using either heavy 
traffic theory or the DSM parameter. 

D.1.2 Response Times for Background Aperiodics 

Refer back to Section 5.2.4 for process and algorithm definitions. This is the (pro- 
cess) state variable simulation algorithm used to collect the region percentiles for the 
PWLM when estimating response times under the background aperiodic scheduling 
discipline. The algorithm was devised for intermediate hyperperiods, but also works 
reasonably well for short and long hyperperiods. 
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— function LPWM Estimation(H,C : in time) return (V,P*) : CDF; 

— This is process state simulation code to construct a CDF for the 

— BGA discipline; interemediate hyperperiod case. 

— constant declarations 

hcmin : counter := 500; acmin : counter := 1500; 

minp : time := min(C,H - C); maxp : time := max(C> H - C)\ 

— variable declarations 
hc,ac: counter := 0; 

tcurr >lcurr,tlmodH,tcmodH : time := 0; 
tcdivH, tldivH, k : natural := 0; 
iarr, svc : (random exponential) time := 0; 
Wa, vmax : time := 0; 
il,i2, vl : index := 1; 
begin 

— the hyperperiod count loop 

while (he < hcmin or ac < acmin) loop 
he he +1; 

— the aperiodic task count loop 
while (tcdivH < tldivH) loop 

ac := ac +1; 

iarr := exp(interar rival time); svc := exp(service time); 
Wa := max(0,Wa - (tcurr - tlast)) + svc; 
if Wa > vmax then vmax := Wa; end if; 
k := Wa div (H-C); 

tlast := tcurr; tlmodH := tcmodH; tldivH := tcdivH; 

tcurr := tcurr 4- iarr; tcdivH := tcurr div H; tcmodH := tcurr mod H; 

if (((tcmodH < minp) and (Wa < (k - 1)(H - C)+ tcmodH)) or 
((minp < tcmodH < maxp) and (Wa < (k - l)(H - C)+ minp)) or 
((maxp < tcmodH) and (Wa < (fc - l)(H - C)+ tcmodH - minp))) then 

il := 2k - 1; 

else il 2k; 

end if; 

P(il) P(il) + 1; if vmax = Wa then vl := il; end if; 
end while; 
while; 

— P = P/ac; Now calculate the return value (V,P*) using the steps in Section 5.2.3 
return (P*,V); 

end LPWM Estimation; 



Figure D.l: BGA IHM LPWM Estimation for Response Times Distributions 



Bibliography 



[1] Soren Asmussen, "Applied Probability and Queues", John Wiley & Sons, 1987 

[2] Ludwig Arnold, "Stochastic Differential Equations: Theory and Applications" , 
Krieger Publishing Company, 1992 (Reprint Edition); Original English Transla- 
tion Edition 1974 (John Wiley & Sons, Inc.) 

[3] Peter J. Bickel and Kjell A. Doksom, "Mathematical Statistics: Basic Ideas and 
Selected Topics", Holden Day, 1977 

[4] Patrick Billingsley, "Convergence of Probability Measures", J. Wiley & Sons, 
1968 

[5] David Y. Burman, "An Analytic Approach to Diffusion Approximations in 
Queueing," PhD Dissertation, Mathematics Department, New York University, 
February 1979 

[6] H. Chetto and M. Chetto, "Some Results of the Earliest Deadline Scheduling 
Algorithm", IEEE Transactions on Software Engineering, Vol. 15, No. 10, Oct. 
1989, pg. 1261-1269 

[7] David M. DeLong, "Crossing Probabilities for a Square Root Boundary by a 
Bessel Process", Commun. Statis.-Theor.Meth., A10(21), 1981, pg. 2197-2231 

[8] Jean-Dominique Deuschel and Daniel W. Stroock, "Large Deviations," Academic 
Press, 1984 

[9] M. D. Donsker, "Justification and Extension of Doob's Heuristic Approach to 
the Kolmogorov-Smirnov Limit Theorems," Ann. Math. Stat, 23, pg. 277-281, 
1952 

156 



BIBLIOGRAPHY 



157 



[10] J, L. Doob, "Heuristic Approach to the Kolmogorov-Smirnov Theorems/' Ann. 
Math. Stat, 20, pg. 393-403, 1949 

[11] Donald Gross and Carl M. Harris, "Fundamentals of Queueing Theory", Third 
Edition, John Wiley & Sons, 1998 

[12] Marion G. Harmon and T,P. Baker, "An Ada Implementation of Marsaglia's 
Universal Random Number Generator", Ada Letters, Volume VIII, Number 2, 
March/April 1988 

[13] M. J. Harrison, "Brownian models of queueing networks with heterogeneous cus- 
tomer populations" , Proceedings of the IMA Workshop on Stochastic Differential 
Systems, Springer- Verlag, 1988, pp. 147-186 

[14] M. J. Harrison, "Brownian Motion and Stochstic Flow Systems," reprint Robert 
E. Krieger Publishing Company, Inc., 1990; original John Wiley and Sons, Inc., 
1985 

[15] Donald L. Inglehart and Ward Whitt, "Multiple Channel Queues in Heavy Traf- 
fic J", Adv. AppL Prob., 2, 150-177-369, 1970(a) 

[16] Donald L. Inglehart and Ward Whitt, "Multiple Channel Queues in Heavy Traf- 
fic.II: Sequences, Networks, and Batches", Adv. AppL Prob., 2, 355-369, 1970(b) 

[17] N.K. Jaiswal, "Priority Queues," Academic Press, 1968 

[18] Daniel P. Johnson, "Diffusion Approximation for Optimal Filtering of Jump Pro- 
cesses and for Queueing Networks" , Ph.D. Thesis, Department of Mathematics, 
University of Wisconson, Madison, 1983 

[19] Samuel Karlin and Howard M. Taylor, "A First Course in Stochastic Processes", 
Academic Press, 1975 



BIBLIOGRAPHY 



158 



[20] Samuel Karlin and Howard M. Taylor, "A Second Course in Stochastic Pro- 
cesses" , Academic Press, 1981 

[21] Leonard Kleinrock, "Queueing Systems, Volume 1: Theory", John Wiley & Sons, 
1975 

[22] Leonard Kleinrock, "Queueing Systems, Volume 2: Computer Applications", 
John Wiley & Sons, 1976 

[23] John P. Lehoczky, Lui Sha, and Ye Ding, "The Rate Monotonic Scheduling 
Algorithm; Exact Characterization and Average Case Behavior", Proceedings of 
the 10 th Real-Time Systems Symposium, IEEE, December, 1989, pp. 166-171 

[24] John P. Lehoczky and Sandra Ramos-Thuel, "An Optimal Algorithm for 
Scheduling Soft- Aperiodic Tasks in Fixed-Priority Preemptive Systems" , Real- 
Time Systems Symposium, IEEE Proceedings, December 1992 

[25] John P. Lehoczky, "Real-Time Queueing Theory", Real-Time Systems Sympo- 
sium, IEEE Proceedings, December 1996 

[26] John P. Lehoczky, "Using Real-Time Queueing Theory to Control Lateness", 
ACM Sigmetrics Conference Proceedings, June 1997 

[27] CL. Lui and J. W. Leyland, "Scheduling Algorithms for Multiprogramming in 
a Hard Real Time Environment", Journal of the ACM 20(1), January 1973, pp 
46-61 

[28] G. Marsaglia, A. Zaman, W.W. Tsang, "Toward a Universal Random Number 
Generator", Statistics and Probability Letters, Volume 9, Number 1, pg 35-39, 
1990 



BIBLIOGRAPHY 



159 



[29] C. Park and F.J, Schuurmann, "Evaluations of Barrier-Crossing Probabilities of 
Wiener Paths", Journal of Appl Prc>6., 1976, pg. 267-275 

[30] Shivendra S. Panwar, Don Towsley, Jack K. Wolf, "Optimal Scheduling Policies 
for a Class of Queues with Customer Deadlines to the Beginning of Service", 
Journal of the Association for Computing Machinery, October 1988, Vol 35, No 
4, pp. 832-844 

[31] N.U. Prabhu, "Foundations of Queueing Theory" , Kluwer Academic Publishers, 
1997 

[32] Sheldon M. Ross, "Introduction to Probability Models", Academic Press, Sixth 
Edition, 1997 

[33] David Siegmund, "Boundary Crossing Probabilities and Statistical Applica- 
tions", Annals of Statistics, Vol 14, No. 2, 1986, pg 361-404 

[34] Robert Serfling, "Approximation Theorems of Mathematical Statistics", John 
Wiley k Sons, 1980 

[35] Lojos Takacs, "Introduction to the Theory of Queues" , Oxford University Press, 
1962 

[36] Hideaki Takagi, "Queueing Analysis: A Foundation of Performance Evaluation", 
Volume 1: Vacation and Priority Systems, Part 1, North-Holland, 1991 

[37] Ward Whitt, "Weak Convergence Theorems for Priority Queues: Preemptive- 
Resume Discipline", Journal of Applied Probability, Vol. 8, pp. 74-94, 1971 

[38] Ronald W. Wolff, "Stochastic Modeling and the Theory of Queues" , Prentice 
Hall, 1989 



